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Innovation versus science? 


Harder economic times will force governments to ask tough questions about their investments in research. 


past 20 years or so. During the unusually long economic boom 

after the end of the cold war, governments of all political stripes 
have accepted the argument that it is in their interest to support ‘the 
best science. Something like a global consensus has emerged on the 
value of curiosity-driven, basic research. The predominant argument 
behind this consensus has been the belief that excellent science — of 
whatever discipline — is likely to spur innovation, which will in turn 
foster economic growth. 

At the same time, direct government sponsorship of technology 
development has fallen out of vogue. Considerable expenditure con- 
tinues, of course, and if the military sphere is included, it still dwarfs 
the resources devoted to basic science. But the idea of explicit state 
support for the development of drugs or circuit boards or civilian 
airliners has been pretty well driven off the table. There has been an 
assumption — even in comparatively centrist nations such as France 
and Japan — that governments aren't good at ‘picking winners: 

This particular era of science and technology policy may now be 
drawing to a close. Even before last week’s juddering stock markets 
sparked talk about a possible US, or even global, recession, policy- 
makers in industrialized countries were watching the flight of much 
industrial production to China and India with intensifying alarm. 

Asa result, the word ‘competitiveness’ is back on the agenda, par- 
ticularly in the United States. When political leaders look at research 
budgets in the light of competitiveness, they are always prone to be 
more drawn to the direct support of innovation through technology 
programmes, than to its indirect support, through basic science. As 
a result, they may start asking questions that are inherently difficult 
for scientists to answer, such as, what will be the economic spin-off 
from this work? What are we getting for our money? 

These questions are currently being posed most directly in the 
United Kingdom, where the research councils, which support most 
university science, seem to be undergoing a subtle change of direction. 
Some scientist groups are already nervous about a paper, “Increasing 
the economic impact of the research councils’, that was published in 
January by the councils’ steering group. 

There is a risk that this process could result in perceived economic 


2 cience and science funding have enjoyed a good run over the 


relevance displacing scientific merit, to a significant extent, as the 
determining factor in the selection of research-council grants. Before 
that happens, it would be reassuring if the leaders of the research 
councils emphatically reiterated that their primary function is to pro- 
mote scientific excellence — and that they will best support Britain's 
national interests, economic and otherwise, by doing exactly that. 

A similar trend is beginning to emerge in the United States. The 
competitiveness legislation passed by Congress and signed into law 
on 9 August is positive for science — 
it lays heavy emphasis on support- 
ing basic research in mathematics, 
the physical sciences and engineer- 
ing. Yet it is by no means certain 
that this approach will be reflected 
in the annual budget process. The 
Democrats, who now control both 
houses of Congress, have tradition- 
ally favoured technology programmes, sometimes over scientific 
ones, and that pattern could resume as fears about national com- 
petitiveness become more acute. 

The argument will then be made for different scientific pro- 
grammes to ‘prove themselves’ in answer to the taxpayers’ question: 
what have you done for us lately? It is important, in such an environ- 
ment, for scientists to hold their nerve. In particular, they should 
resist the inevitable demand that they start to compete with each 
other on the basis of specious metrics that bureaucrats, given half a 
chance, will construct as surrogates for economic impact. 

Straitened economic circumstances, whenever they arrive, will 
mean straitened times for science: that much the research community 
must be ready to accept. It can also anticipate attempts to cut and splice 
the pie on the basis of ungrounded expectations that some disciplines 
will yield economic dividends, and others wont. Researchers have at 
least two weapons that they should keep well honed: a compelling his- 
torical narrative showing the unpredictable paths from science across 
all disciplines to economic and other benefits; and a demonstration 
that those best-placed to innovate on the basis of science — and, in 
turn, to stimulate scientific ideas — are well set up to do so. | 


“Scientists should resist 
the demand to compete 
on the basis of specious 
metrics constructed as 
surrogates for economic 
impact.” 


Indentured labour 


The deal at the foot of the scientific totem pole 
remains a raw one. 


(FASEB) has just released a report on the career trajectories of 
young life scientists in the United States (see page 848). It is 
likely to give pause to some of those currently considering graduate 
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training as a route to a career in the academic life sciences. 

The survey finds that over two decades the number of academi- 
cally employed life scientists in tenured or tenure-track positions 
has remained stuck at about 30,000, while the number of doctoral 
degrees awarded in the life sciences has doubled. Thus the proportion 
of postdocs actually reaching tenured or tenure-track positions has 
dropped from nearly 45% in the early 1980s, to just below 30%. 

The data also reveal a hard-to-reach career getting farther out of 
reach. The age at which the average PhD holder receives his or her 
first full National Institutes of Health grant has risen from 34 in 1970 
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to 42 now. Postdocs, facing such a late start to their professional lives, 
are increasingly jumping ship to industry. 

Academic institutions rely heavily on graduate students and 
postdocs to bring in tuition or overhead funding and to carry their 
share of the teaching load. The motivation for principal investigators 
is even stronger. Students and postdocs carry out the day-to-day work 
in laboratories serving as cheap, well trained labour. Moreover the 
nature of discovery often seems to require big numbers: far better to 
have six postdocs working on several projects, in case one of them 
gets results that will ensure funding for the laboratory for years. 

This pattern has, of course, been familiar for years — and not just 
in the United States. Postdocs find themselves bouncing around 
the world from lab to lab, seldom earning much more than they 
would have done in their first year on the job market with their 
undergraduate degree. Funding is short, the hours are long, and 
prospects uncertain. 

Postdocs have occasionally attempted to band together in solidar- 
ity and seek a better settlement from their employers, the institu- 
tions and universities. But this movement has been stronger in the 
social sciences than in the hard sciences. The transient nature of the 


work, together with its convoluted employment structure, has made it 
difficult for them to speak effectively with a single voice. Instead, the 
plight of the postdoc will probably change only if the issue of scientific 
training is addressed from the top, where it 


may be necessary to consider the possibility “Funding is short, 
that too many scientists are being trained. the hours are long, 
There is an argument that, fromanational and prospects 


policy perspective, the current situation is 
ultimately productive. The pace of discovery 
is quickened by a sizeable workforce, and able scientists end up doing 
multiple jobs, most of them in the private sector of the economy. It 
might not be exactly what the students had in mind in the first place, 
but the situation hardly constitutes a major cause for concern. 

But FASEB’ data suggest that too many graduate schools may be pre- 
paring too many students, so that too few young scientists have a real 
prospect of making a career in academic science. More effort is needed 
to ensure that recruitment interviews include realistic assessments of 
prospective students’ expectations and potential in the academic work- 
place. And training should address broader career options from day 
one rather than focusing unrealistically on jobs that don't exist. a 


uncertain.” 


Technology trap 


California is right to sound a cautionary note on 
electronic voting. 


cient and secure may sound like an easy thing to do. And the 

pay-off — a democracy in which more people can participate 
and trust — seems desirable. But an academic analysis of three widely 
used systems in California has found monumental weaknesses in 
each of them. As a result, the state is slowing down its adoption of 
such systems until significant improvements are made. Others should 
exercise similar caution. 

The study, commissioned by California's secretary of state, Debra 
Bowen, was led by computer scientists at the Berkeley and Davis 
campuses of the University of California. It found that the systems 
sold by three companies — Sequoia Voting Systems, Hart InterCivic 
and Diebold — had not been designed with security requirements 
in mind. And one particular deficit alarmed representatives of all 
political parties: the possibility that computer viruses could distort 
vote counts. 

On 3 August, Bowen decertified the systems, which were already in 
use in counties where about half of the state’s voters live. That means 
that in the primary elections next February, voters will return to paper 
ballots. Bowen has pledged to fully recertify the machines when they 
comply with a list of basic requirements: but the study authors question 
whether the software and hardware are amenable to ready repair. “They 
have serious security problems that will take years to fix,” says David 
Wagner, a study leader at the University of California, Berkeley. 

This isrrt the first time that specialists have warned against elec- 
tronic voting systems. The Voting Technology Project, for example, 
a joint effort between the Massachusetts and California Institutes 


D esigning an electronic voting system that is easy to use, effi- 
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of Technology, highlighted their failings back in 2001 (see Nature 
412, 258; 2001). 

Yet the march of voting automation continues worldwide, often 
driven not by the public good but by election officials’ desire for low 
staff costs and quick counts — as well as by the marketing machines 
of the systems’ suppliers. Even in the United States, the Californian 
analysis is unlikely to make much of a difference in the many other 
states where the same electronic systems are being introduced. 
Verifiedvoting.org, a non-partisan lobby group that campaigns for 
reliable voting, says that although some secretaries of state are pay- 
ing attention to the study, others — especially in the south and the 
midwest — don't seem to be interested. 

There remains a body of public officials who seem to favour expe- 
diency and convenience over the democratic imperative of an accu- 
rate count. The firms that sell the systems have, meanwhile, argued 
that in the real world of elections, the systems will be overseen by 
election officials and candidates who would protect against the kind 
of disruptions identified in laboratory studies. 

After the scandal that unfolded in Florida in the 2000 presidential 
election, when President George W. Bush eked out a narrow victory 
after prolonged legal arguments over disputed ballots in several coun- 
ties, Congress passed a law that, among other things, helps to fund 
the replacement of existing, outmoded voting equipment. Now it is 
set to revisit the issue, with Senator Dianne Feinstein (Democrat, 
California) pledging to hold hearings that will pick up where the 
review in her own state left off. This may spur broader federal action 
to strengthen voting systems. 

The consistent message from studies of electronic voting systems 
has been that the technology is often being implemented before it 
is ready to achieve the levels of security and reliability that voters 
are entitled to expect. Other jurisdictions worldwide should follow 
California’s lead, consult with computer scientists, and act where nec- 
essary to stop this from happening. 7 


©2007 Nature Publishing Group 


nature 


RESEARCH AIGHLIGHTS 


The shadow of drought 
Geophys. Res. Lett. 34, L15702 (2007) 


The worldwide cloud of aerosols produced 
by the eruption of Mount Pinatubo in 1991 
(pictured) led to a remarkable slowing 
of Earth's hydrological cycle, according 
to researchers at the National Center for 
Atmospheric Research in Boulder, Colorado. 
Kevin Trenberth and Aiguo Dai show that 
1992 was deficient in both precipitation over 
land and streamflow to a degree unseen in 
55 years of records, probably as a result of 
reduced incoming solar radiation leading to 
reduced evaporation. The effect remained 
significant even when the impact of a near- 
concurrent El Nifio event was factored out. 
The authors warn that ‘geoengineering’ 
projects designed to offset greenhouse 
warming by reduing the amount of 
incoming solar radiation on a global scale 
may have similar consequences. 


CELL BIOLOGY 


Little helper 


Science 317, 957-961 and 961-964 (2007) 
Two studies offer molecular insight into 
how proteins get into and across cellular 
membranes. 

Both studies suggest that protein structures 
known as POTRA domains, found in the 
outer membranes of Gram-negative bacteria, 
mitochondria and chloroplasts, begin the job 
by binding membrane-destined proteins. 

Vincent Villeret at the Institute of Biology 
of Lille, France, Frangoise Jacob-Dubuisson 
at the Lille Pasteur Institute and their co- 
workers solved the structure of part of a 
membrane protein known as FhaC from 
Bordatella pertusis, which causes whooping 
cough. FhaC forms a channel that transports 
virulence proteins across the membrane, 
opening when a portion of one of its POTRA 
domains binds the virulence protein. 
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Daniel Kahne at Harvard University in 
Cambridge, Massachusetts, and his colleagues 
studied a fragment of a protein known as YaeT 
from Escherichia coli. It forms a scaffold for a 
complex that folds other membrane proteins. 
A portion of one of its POTRA domains 
seems to bind these proteins. 


CHEMISTRY 
Blinking hot 


J. Am. Chem. Soc. doi:10.1021/ja0715905 (2007) 
You can't use a medicine-cabinet 
thermometer in a microlitre of fluid. 
Scientists wanting to take the temperature of 
reagents in microfluidic devices might turn 
toa ‘blinking’ molecule instead. 

The proposal comes from Cécile Fradin 
and her team at McMaster University in 
Hamilton, Canada. They note that under laser 
illumination, enhanced green fluorescent 
protein switches between a glowing and a 
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dull state with the removal and addition ofa 
proton. How fast the protein ‘blinks’ depends 
on two properties of the solution to which it is 
added: acidity and temperature. 

Previous methods to gauge temperature 
with dyes have relied on changes in 
fluorescence intensity, which yield only 
relative measurements. This protein, at 
known pH, gives an absolute temperature 
measurement. 


CANCER BIOLOGY 


Tube travel 


Nature Cell Biol. 9, 893-904 (2007) 

With the help of time-lapse microscopy, 
researchers have revealed how tumours 
degrade and invade the protein matrix around 
healthy cells to spread through the body. 

Peter Fried] at the University of Wiirzburg 
in Germany and his colleagues studied 
fibrosarcoma and breast cancer cells as the 
cells invaded three-dimensional blocks of 
collagen in a lab dish. The team identified two 
key steps. First, single cells rearrange collagen 
fibres that are blocking their way into tube-like 
holes. Then, groups of cells behind the leading 
cells expand the tubes (pictured left), clearing 
away larger areas of the matrix with the help of 
enzymes including membrane-type-1 matrix 
metalloproteinase (MT1-MMP). 


SOLID-STATE PHYSICS 


Allfor one 


Nano Lett. 7, 2506-2512 (2007) 

In today’s silicon solar cells, which account for 
more than 90% of the solar-cell market, each 
absorbed photon produces a pair of charge 
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carriers of fixed energy. Any surplus energy 
from the original photon is lost as heat. 
However, in nanoparticles of certain 
semiconductors, such as lead selenide, a 
single photon that has enough energy to 
produce two or more pairs of charge carriers 
can do so. Now Matt Beard, Randy Ellingson, 
Arthur Nozik and other colleagues at the 
National Renewable Energy Laboratory 
in Golden, Colorado, show that the same 
can happen in silicon nanoparticles. This 
‘multiple exciton generation would give 
silicon-nanoparticle-based cells a theoretical 
peak efficiency of about 44%, compared with 
around 33% for traditional silicon solar cells. 
But the researchers have yet to work out 
how to make use of the charge carriers in 
such particles. 


GENETICS 


Red or white 


PloS Genet. 3, e133 (2007) 

Researchers have surveyed the genomes of 
440 rice strains to work out the evolutionary 
history of how wild rice lost its red colour to 
cultivation. 

Susan McCouch at Cornell University in 
Ithaca, New York, and her colleagues report 
that more than 97% of the white-grained 
rices they studied have the 14 base-pair 
deletion previously identified with the loss 
of the red outer layer, or pericarp, of wild rice 
grains. Other clues in the genome suggest this 
mutation arose once in japonica subspecies 
and then spread around the world, probably 
through human trade. It is thought that 
humans might have preferred white rice over 
red for its novelty, its softer shell, or because 
it was easier to pick out insects or dirt from 
ameal. A second mutation associated with 
white grains appeared in fewer than 3% of 
species studied. 


ECOLOGY 


Worm wood 


Conserv. Biol. 21, 997-1008 (2007) 
Traipsing through the sugar-maple forests 
of North America has become a different 
experience since alien earthworms invaded. 

Andrew Holdsworth and his team at the 
University of Minnesota in St Paul have 
examined various sites in two National 
Forests in the northern United States. They 
found a clear negative correlation between 
the presence of worms from the European 
genus Lumbricus (pictured right) and plant 
diversity, particularly the diversity of small 
non-grassy plants, or forbs. 

This confirms past observations 
of Lumbricus disrupting native plant 
assemblages. The suggested reason? The 
European worms eat leaf litter that would 
otherwise shelter seeds and host nutrient- 
providing fungi. Asa result, species such as 
wild sarsaparilla (Aralia nudicaulis L.) and 
rose twisted stalk (Streptopus roseus Michx.) 
disappear. 


ASTRONOMY 


Chemical misfit 


Astrophys. J. 665, L127-L130 (2007) 
Propene has been detected in space for 
the first time, in a dark interstellar cloud 
in the constellation of Taurus. No one had 
previously looked for the molecule (C;H,) 
because, although such clouds can be rich in 
hydrocarbons, there seemed to be no obvious 
mechanism by which propene would form. 
Nuria Marcelino at Instituto de Estructura 
de la Materia in Madrid, Spain, and her 
colleagues made the discovery with the 
IRAM 30-metre radio telescope near 
Granada. Marcalino’s team suggests that 
now they have found propene, interstellar 
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chemistry 
models will need 

to be revisited, with 
more attention paid 
to small-hydrocarbon 
chemistry. 


QUANTUM PHYSICS 


Massive interference 


Nature Phys. doi:10.1038/nphys701 (2007) 
Shooting a beam of electrons, atoms or 
molecules through a grating is the classic way 
to reveal the particles’ quantum wave-like 
nature. The beam produces an interference 
pattern, just as light would. 

Markus Arndt of the University of Vienna, 
Austria, and his colleagues report a nifty way 
to extend this technique to larger molecules 
than it has been possible to study so far. 

Large molecules tend to interact with a 
grating, blurring the interference pattern. 
Arndt’s team overcome this problem with 
a clever arrangement of three gratings 
— the second made not of material, but of 
a ‘standing wave’ of light. The properties of 
the light barrier can be tuned to optimize 
the contrast of the interference image, as the 
authors demonstrate for ”C — a fullerene 
— and azobenzene molecules. 


JOURNAL CLUB 


David Stevenson 
California Institute of 
Technology, USA 


Aplanetary scientist foresees a 
shift in the debate about Earth's 
heat flow. 


Measurements of the heat coming 
out of Earth’s interior have long 
posed a puzzle for understanding 
the planet's history. 

Earth's heat output is estimated 
to be around 44 terawatts, 
about twice that expected from 
radioactive decay. The difference 


can be attributed to cooling of the 
deep Earth, implying a present- 
day cooling rate of 100 kelvin per 
billion years. But simple models 
with this much cooling ‘blow up’ 
when they are run back in time, 
predicting ridiculous temperatures 
for the early Earth. Acceptable 
models rely on unconventional 
deviations from the usual simple 
scaling laws for mantle convection. 
This is an attractive but untested 
idea. 

land many others have 
wondered whether an alternative 
explanation is that today's heat 
flow is higher than the average for 


the past half a billion years. Such 
fluctuations could arise as a result 
of the dispersal and accumulation 
of continental land masses. 

A recent paper (J. Korenaga 
Earth Planet. Sci. Lett. 257, 
350-358; 2007) assessed this 
possibility by taking advantage of 
a long-known connection between 
sea level and the heat flow from 
sea-floor spreading. It finds little 
room for more than a few percent 
fluctuation in heat flow around its 
long-term decline. 

| think this pushes the problem 
back into the realm of models, 
focusing attention on plate 
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tectonics, the deep water cycle 
(because water affects how rocks 
flow), and perhaps even the long- 
standing question of whether 
Earth's mantle is well mixed from 
top to bottom. 

On a decadal timescale, we 
can hope that better 
measurements of heat generation 
and flow will be combined with 
more realistic theory. Like many 
central Earth science questions, 
the heat-flow problem resists 
quick resolution. 


Discuss this paper at http://blogs. 


nature.com/nature/journalclub 
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NEWS 


Oldest gorilla ages our joint ancestor 


The last common ancestor of humans and 
gorillas might have lived at least 2 million 
years earlier than previously thought. Fossil- 
ized teeth of the earliest gorilla ever discov- 
ered, dating to 10 million years ago, have been 
found in Africa, say researchers. 

The new species (Chororapithecus abyssini- 
cus) from Ethiopia, reported on page 921 of 
this issue, helps to fill in a huge gap in the fossil 
record. The team of Ethiopian and 


Japanese researchers has based its “Africawasthe determine the hom- 

conclusion on just nine teeth from place of origin inin line's divergence 
at least three individuals of the spe- from chimps, previ- 
cies, which were discovered in the of both humans ously pegged at about 
desert scrubland of Afar about 170 and modern 6 million years ago. 


kilometres east of Addis Ababa. 
The teeth, eight molars and a 
canine, “are collectively indistin- 
guishable from modern gorilla subspecies” in 
size, proportion and scan-revealed internal 
structure, says Gen Suwa of the University 
of Tokyo Museum, Japan, who led the study. 
The team argues that the gorilla’s divergence 
date from the human lineage is not about 8 
million years ago as previously surmised (S. 
Kumar et al. Proc. Natl Acad. Sci. USA 102, 
18842-18847; 2005), but “greater than 10 to 
11 million years ago” on the basis of the age of 


African apes.” 


the new species. Functionally, 
he adds, the teeth already seem 
to be evolving — they could 
shear through a plant diet, a 
gorilla trait — although other 
herbivore apes also exist in the 
fossil record. 
This finding could prompt 
discussions of how anthropolo- 
gists and geneticists 


“Chororapithecus abys- 
sinicus suggests, once 
again, that Africa was 
the place of origin of both humans 
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Eurasia as some researchers have 
argued, says Suwa. 

But palaeoanthropologist Jay Kelley, who 
studies primate teeth at the University of IIli- 
nois at Chicago and was not involved in the 
study, is sceptical. “I’m not convinced it is a 
gorilla,” he says. More fossils, analysis and 
debate will be needed to determine whether 


the specimen is ancestral to hominids, he adds. 
For now, he would be “very cautious” about 
using the specimen to realign divergence dates 
between hominins and gorillas—chimps. 
Suwa’ team is part of the Revealing Human 
Origins Initiative (RHOJ), a project that 
searches at multiple sites in Africa, Europe 


Ocean circulation nolsy, not stalling 


Suggestions of a 
substantial decline in the 
Atlantic Ocean circulation 
that carries warm tropical 
water northwards seem 
to be largely unfounded. 
New data — recorded 
between the Bahamas 
and the Canary Islands 
— show that the seemingly 
dramatic reduction 
discovered two years ago in 
the strength of the Atlantic 
meridional overturning 
circulation (MOC) is easily 
within the range of huge 
seasonal variability. 

In 2005, ateam 
led by Harry Bryden, 
an oceanographer 
at the University of 
Southampton, UK, 
reported a 30% decrease 
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in the MOC at 26.5° 
north, after comparing 
their measurements from 
ship-based instruments 
in 2004 with similar data 
from 1957, 1981, 1992 
and 1998 (H. L. Bryden, 
H. R. Longworth and S. A. 
Cunningham Nature 438, 
655-657; 2005). They 
did caution at the time 
that, due to the scarcity 
of observations, the error 
bounds of their study were 
exceptionally wide. 
Nevertheless, the 
finding fuelled fears of 
an imminent collapse 
of the thermohaline 
Atlantic circulation — the 
ocean conveyor driven by 
temperature and salinity 
differences that gives 


rise to the Gulf Stream, 
allowing western Europe 

to enjoy a relatively mild 
climate. Alarmist headlines 
warned that a mini ice age 
would hit Britain. 

However, 12 months’ 
worth of data from an array 
of moored instruments, 
deployed during the 
same 2004 cruise that 
yielded the most recent 
measurements included 
in Bryden's study, now 
suggest that the observed 
changes were due to short- 
term variability and not the 
result of global warming. 

Stuart Cunningham 
of the National 
Oceanography Centre 
in Southampton and his 
colleagues found that the 
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During just one year, Stuart Cunningham's team 


found enormous variability in the northwards flow of 
thermocline layer circulation (red), the deep waters of 
the upper (light blue), and the lower (dark blue) North 
Atlantic. Even the flow of intermediate water (green) 


fluctuated. 
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Fossilized teeth from the earliest-found gorilla (top, left), discovered at a site in Ethiopia 
(above), are remarkably similar to those of a modern gorilla (bottom, left). 


and Asia for species that predate the 


earliest known hominid, the 7-million-year- 
old Sahelanthropus tchadensis (M. Brunet et al. 
Nature 418, 145-151; 2002). 

Between 15 million and 22 million years 
ago, there were dozens of primate species 
across Africa and Euroasia — and apes domi- 
nated the primate scene. But fossils show that 
these species don’t share the characteristics 


of modern African apes. “From that species 
pool, the common ancestor of African apes 
and humans branched out,’ says Tim White, 
a palaeoanthropologist at the University of 
California, Berkeley, and an RHOI director. 
“And the goal of RHOI is to find the common 
ancestors. With C. abyssinicus, we now can see 
an ancestral African ape.’ 

Rex Dalton 


strength of the overturning 
circulation varied wildly 
— bya factor of 8 — between 
March 2004 and March 
2005 (S. A. Cunningham 
et al. Science 317, 935-937; 
2007). Bryden, who was also 
involved in the new study, had 
unknowingly probed the ocean 
during a period of relatively 
low circulation, they conclude. 
“| guess we can give 
[thermohaline] the all-clear," 
says Jochem Marotzke, 
an oceanographer at the 
Max Planck Institute for 
Meteorology in Hamburg, 
Germany, formerly principal 
investigator of Rapid Climate 
Change (RAPID), a monitoring 
programme funded by the UK 
Natural Environment Research 
Council. The Atlantic sensor 
array, which is to remain 
operational until 2014, is part 
of the RAPID programme. 


Available data provide no 
statistically significant 
evidence for a change of 

the overturning circulation 
over the past 50 years, says 
Marotzke, a co-author on the 
new study. 

From now, the RAPID 
system should allow 
researchers to detect 
circulation changes from one 
year to the next, provided 
the average circulation 
volume changes by 20% 
or more. But because the 
unknown variability from 
year to year may also be 
large, it will take at least ten 
more years of continuous 
measurements until a possible 
downward trend will become 
recognizable in the data noise. 

But Bryden remains 
unconvinced that the RAPID 
measurements in the new 
study have rendered his 


initial findings null and void. 
The depth of the thermocline 
— the layer of rapid heat 
transfer between the mixed- 
water layer at the surface and 
deep-ocean water — varies 
across the basin from west 
to mid-ocean, changing the 
recirculation of water. An 
increase in the slope of the 
thermocline along with the 
observed freshening of deep 
water near the Bahamas, do 
indicate a 10-15% decline 
in average circulation since 
1980, Bryden maintains. 
“RAPID is quite an 
achievement,” Bryden says. 
“But | don’t share the opinion 
that our previous results 
are just due to seasonal 
variability. It may not be 
statistically reliable, but all the 
science points to a slow down 
of the Atlantic circulation.” 
Quirin Schiermeier 
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“In our courses you 
will have the possibility 
to renew the natural 
contact you have with 
the angels. Use angels 
and your own power to 
create miracles in your 
own life.>> 


Norway's Princess Martha Louise's 
school for communicating with 
angels opens on 22 August and 
courses are already sold out. But 
TV pastor Jan Hanvold has unkindly 
called the clairvoyant princess 

“an emissary from hell”. Sidelines 
wonders who the angels prefer to 
talk to. 


Left-clawed crabs 

You'd think that being a 

left-clawed Australian 
fiddler crab when 98% are 
right-clawed would give you an 
advantage ina fight. But it 
turns out that left-clawed 
crabs are less likely to fight, 
and when they do, = 
they usually 
lose. Perhaps 
that's why 
they are so 
much rarer... 


Ghost crabs 

Ina study involving off- 

road vehicles on sandy 
Australian beaches scientists 
discovered that no ghost crabs 
buried less than 5 cm deep 
survive being run over 10 times, 
while more than 70% survive 
if buried at depths of 20 cm. 
Someone should tell the crabs. 


M. JENNIONS 


is the latest estimate of the 
maximum number of moves 
needed to solve any Rubik's cube. 


is the time a 
supercomputer took to whittle 
down to this number from a best 
estimate of 29 moves. 


is the 
theoretical estimate of the 
maximum number of moves ever 
needed — a number still unproven. 


is the 
total number of Rubik's cube 
positions. 


Sources: BBC, Spiegel, Proc. R. Soc. B, 
Mar. Ecol. 
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Fleece myth hints at 
golden age for Georgia 


The legend of Jason and the Argonauts’ heroic 
quest for the Golden Fleece may provide a fas- 
cinating window into real Bronze Age society, 
say researchers who believe they have found 
the earliest evidence of a trading gold mine. 
The adventure, which was set about 
1300 Bc —a generation before the Trojan war 
described by Homer — involves a mission 
to capture the precious fleece 
from Colchis, in modern- 
day Georgia. The myth- 
ological fleece has been 
interpreted in various 
ways, but is thought by 
some to symbolize 
the vast wealth and 
gold of Georgia. 
Although gold was used and 
traded in ancient times, there has been scant 
evidence of where the precious metal used 
in Eurasia during the Bronze Age originated, 
and whether it was mined or panned from 
rivers. Gold mines in the Upper Nile region 
of Egypt and elsewhere in Africa date from 
2,500 Bc and are believed to have supplied the 
Ancient Egyptians’ needs. But the legend of the 
Golden Fleece hints at a thriving gold trade in 


the Caucasus dating back further than was first 
thought. 

In 2004, a team of German and Georgian 
archaeologists discovered what may be the 
world’s oldest gold mine at Sakdrissi, about 
70 kilometres southwest 

of Tbilisi. Thomas 
Stdllner and Andreas 
: Hauptmann of Ruhr 
‘University in Bochum, 
Germany, and Irina 
Gambashidze, an archaeologist 
from the Centre of Archaeolog- 
ical Studies in Tbilisi, dated the 
mine to around 3000 Bc. 

Now, the researchers have 

produced evidence that suggests 
this mine was trading as far back as 2500 Bc. 
The team analysed samples of gold from the 
mine and compared them with gold from 
some 25 ancient artefacts dating from 2500 
to 1500 Bc in the Bronze Age. The beautifully 
worked pieces, housed in museum collections 
in Georgia and neighbouring countries, include 
needles, necklaces and rings. The gold was ana- 
lysed using a process called LA-ICP-MS (laser 
ablation-inductively coupled plasma—mass 


Analysis of Bronze Age artefacts, such as the lion 
(left), offers historical links to the Golden Fleece. 


spectrometry), in which a tiny layer of surface 
metal is vaporized and the various elements in 
the vapour are identified by their mass. 


Journal presents a mathematical conundrum 


A branch of mathematics known 
as K-theory has this month seen 
its flagship journal bifurcate ina 
strange dispute, the details of which 
were still emerging as Nature went 
to press. 

The events centre on Anthony Bak, 
a mathematician at the University 
of Bielefeld in Germany and until 
recently managing editor of the 
Springer journal K-Theory. Bak left 
the monthly journal, due, he says, to 
production problems and a too-high 
subscription rate. He has arranged 
to launch a rival title with another 
publisher that has offered a lower 
subscription rate, in the process 
shedding some editors unhappy at 
how the shift has been managed. 

In seeking to cut prices, the 
editors follow the lead of the 
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editorial board of the Elsevier 
mathematics journal Topology, 
which quit to launch a competing 
non-profit title (see Nature 445, 
351; 2007). 

The cover date of the last issue of 
the supposedly monthly K-Theory 
was April 2006. Bak has withheld 
all papers accepted by the journal 
since then — a situation that not 
all of the editors, or the papers’ 
authors, seem to have been aware 
of until recently. 

In January 2007, the publisher 
sacked Bak. That month Bak wrote 
to his editorial board members 
asking them to resign en masse, 
which they did. As well as the 
cost of the journal, he mentioned 
production problems at Springer. But 
he neglected to tell Springer of the 


resignations. “At no point did Bak 
communicate to Springer directly 
that the board had resigned,” says 
Catriona Byrne, editorial director for 
mathematics at Springer. 

It was only in May that Springer 


learned that the board had resigned. 


In August, Bak announced publicly 
that he had arranged to launcha 
rival title, the Journal of K-Theory, 
with Cambridge University Press at 
a lower subscription rate. Springer 
has now said that it will lower the 
K-Theory subscription rate. 

Eric Friedlander at Northwestern 
University in Evanston, Illinois, 
a former editor at K-Theory, is in 
principle sympathetic to the switch. 
“There is a lot of concern in the 
mathematics community about the 
cost of journals,” he says. But on 
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17 August, Friedlander wrote to Bak 
to say that despite being named as 
amember of the editorial board of 
the new journal, he was not willing 
to serve “because | cannot endorse 
the process by which you have 
withheld manuscripts submitted 
to K-Theory and proceeded without 
consultation with authors and the 
editorial board”. 

One of the editors who resigned, 
Andrew Ranicki of the University 
of Edinburgh, UK, has returned 
to the Springer journal. He and 
mathematician Wolfgang Liick of 
the University of Miinster, Germany, 
issued a statement on 13 August to 
say that they would act as interim 
managing editors for the journal. 

Bak launched K-Theory in the 
1980s and ran it successfully 


NEWS 


The ratio of trace elements in the gold helps 
to pinpoint where it came from. In this case, the 
team found similar levels of lead and osmium 
in the gold from the mine in Sakdrissi as in 
the artefacts. But the clincher was the unusu- 
ally high amount of antimony in the artefacts, 


which is characteristic of the particular geo- 
chemistry at Sakdrissi. 

Although this suggests a strong link between 
the mine and the artefacts, it is possible that the 
gold used was not mined, but panned from riv- 
ers in the local area. Villagers in parts of Georgia 
are still reported to pan the rivers for gold using 
sheep fleeces — another potential link to the tale 
of the Golden Fleece. Gold obtained from rivers 
tends to contain elevated levels of metals such 
as osmium, tin and palladium, so the archae- 
ologists are now assessing the osmium levels 
in gold taken from the river to see whether the 
gold in the artefacts was panned or mined. 

Fingerprinting gold is one of the most 
difficult tasks in archaeometry, says Maria- 
Filomena Guerra, a physicist at the Centre for 
Research and Restoration of the Museums of 
France in Paris, who is separately trying to fol- 
low the gold trail. Guerra’s team has been ana- 
lysing gold artefacts from a Mycenaean tomb 
at Kazanaki near Volos, the modern Greek 
town close to the area where legend says Jason 
came from. The tomb dates from 1400 Bc 
and Guerra has so far analysed a gold-beaded 
necklace and gold leaf interred there, using a 
technique called PIXE XRF (particle-induced 
X-ray emission X-ray fluorescence). 

Guerra found levels of tin, platinum and pal- 
ladium in the tomb samples that indicate the 
gold came from a river source and was almost 
certainly panned for, she says. She now plans 
further analyses to prove her hunch that the 
gold was panned in Georgia. ] 
Emiliano Feresin 


for many years. He became 
increasingly unhappy after 
Springer acquired the journal 
through its merger with Kluwer 
Academic Publishers in 2004. 
Bak says that the editorial 
board agreed in 2004 to 


"There is a lot of 
concern in the 
mathematics 
community about 
the cost of journals” 


consider moving to a different 
publisher. 

Byrne admits there were 
some teething problems, but 
says these were being or had 
been dealt with. She adds that 
Bak failed to send sufficient 
manuscripts to fill the journal 
and that those he did send 


arrived erratically, creating 
difficulties with the workflow. 
Bak confirms that after April 
2006 he stopped sending 
Springer the manuscripts 
passed on to him by K-Theory's 
other editors. 

Friedlander is uncomfortable 
that papers were held up: “Our 
responsibility is to review 
mathematics that is submitted 
to us and disseminate it.” Liick 
calls it "a scandal". 

Another former editor says 
they understood the backlog 
at Springer to have been so 
bad that the withholding of 
the papers would have made 
little difference to when they 
appeared. Byrne says: “This is 
definitely a misconception." 

Liick and Ranicki have asked 
authors to contact them if 
they have papers accepted 


by K-Theory that they want to 
be published in the journal. 
Meanwhile, Bak has offered 
authors publication in the new 
Journal of K-Theory. 

K-theory, which is concerned 
with the algebraic properties 
of space, has had some 
high-profile moments. One 
major contributor to the field, 
Hyman Bass of the University 
of Michigan, received a US 
National Medal of Science on 
27 July. But K-theory remains 
arelatively small discipline, 
which does not need more than 
one dedicated journal. 

For this reason, Ranicki and 
Liick said on 17 August that 
they will not try to continue 
K-Theory, but deal only with the 
papers that have already been 
accepted. | 
Jenny Hogan 
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More biologists but tenure stays static 


It’s official: biology postgraduates in the United 
States face greater competition for tenure than 
ever before. A wealth of data released this month 
will reopen discussions about employment and 
training in the US biomedical system. 

The data, compiled by the Federation of 
American Societies for Experimental Biology 
(FASEB) (opa.faseb.org/pages/Policylssues/ 
training_datappt.htm), are from many sources, 
including the US National Science Foundation 
(NSB), the Council of Graduate Schools and the 
National Institutes of Health (NIH). And one 
message is clear: increasing numbers of bright 
young students are eager for a career in biology 
and biomedicine, but fewer than before will gain 
the coveted tenured academic positions. 

NSF data show that the number of students in 
US graduate programmes in the biological sci- 
ences has increased steadily since 1966. In 2005, 
around 7,000 graduates earned a doctorate. But 
the number of biomedical PhDs with academic 
tenure has remained steady since 1981, at just 
over 20,000. During that period the percentage 
of US biomedical PhDs with tenure or tenure- 
track jobs dropped from nearly 45% to just 
below 30%. 

So where do all the graduate students end up? 
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The percentage of biomedical PhDs going into 
industry has tripled, from 10% to 30%, since the 
1970s, the NSF reports. But those who stay on 
the academic track face a more arduous slog than 
their mentors. Although numbers of applicants 


(eee Applications 


Year 


—A— Success rate 


for postdoctoral fellowships awarded by the NIH 
increased between 2002 and 2006, the percent- 
age who were successful dropped sharply (see 
graphic above). And the average age of scientists 
earning their first RO1 grant — the NIH’s bread- 


Is baby DVD research Mickey Mouse science? 


The Walt Disney Company is going 
toe-to-toe with the University of 
Washington in Seattle after a study 
by the university's researchers 
suggested that exposure to DVDs 
and videos for babies, including 
Disney's Baby Einstein, could be 
associated with poorer language 
development. 

One of the team, professor of 
paediatrics Dimitri Christakis, was 
widely quoted as saying “I would 
rather babies watch American Idol 
than these videos.” 

Robert Iger, Disney's chief 
executive, says the study's 
“methodology is doubtful, its data 
seem anomalous and the inferences 
it posits unreliable”. Ina letter 
to the university, he accused it of 
issuing a “deliberately misleading, 
irresponsible and derogatory" press 
release. “Whether your university 
is comfortable associating its name 
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with analysis of this quality is, of 
course, your decision,” he wrote. 
“And | would not be reaching out 
to you if all that was at stake was a 
poorly done academic study.” 
Whatis at stake is a million- 
dollar industry in such products for 
babies. A full set of Baby Einstein 
DVDs costs US$369.99. Baby 
Einstein packaging says it “is not 
designed to make babies smarter", 
but detractors claim such products 
are marketed as educational. 
“Disney is expecting the brand 
to bring in $1 billion by 2010,” 
says psychologist Susan Linn of 
Harvard Medical School in Boston, 
Massachusetts, and a founder of 
the Boston-based Campaign for 
a Commercial-Free Childhood, a 
group opposed to marketing to 
children. In May 2006 the campaign 
asked the US Federal Trade 
Commission (FTC) to investigate 


Baby Einstein and Brainy Baby 

of Alpharetta, Georgia, another 
leading manufacturer of baby 
videos, for “engaging in deceptive 
acts and practices”. They were 
backed by the American Academy 
of Pediatrics and the American 
Academy of Child and Adolescent 
Psychiatry. The complaint is still 
being considered. 

Linn sees the entire ‘baby 
industry’ becoming more litigious. 
“We can expect more of this kind of 
corporate intimidation,” she warns. 
“Disney is on the defensive and 
they're going to come out swinging." 

The study, published online 
earlier this month (F. Zimmerman 
et al. J. Pediatr. doi:10.1016/ 
j.jpeds.2007.04.071; 2007), found 
that babies aged 8 to 16 months 
who watched such videos scored 
lower than other babies on the 
Communicative Development 
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Inventory (CDI), a standard 

tool used to gauge language 
development in infants. Babies that 
watched an hour a day scored 17 
points lower on the CDI scale — 
corresponding to knowing seven 
fewer words than a typical baby in 
the study who did not watch the 
videos, the researchers say. 

Lead author Frederick 
Zimmerman suggests several 
explanations for their findings, 
including the fact that parents 
worried about their child's language 
development might turn to the 
videos. But “it is possible that heavy 
viewing of baby DVDs/videos has a 
deleterious effect on early language 
development,” he says. 

The study was press-released 
by his university under the 
headline “Baby DVDs, videos may 
hinder, not help, infants’ language 
development”. University president 
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and-butter grant to an independent researcher — 
has risen from 34 in 1970 to 42 now. 

The percentage of PhDs still in a postdoctoral 
fellowship three or four years after their doctor- 
ate has declined since 1997, from 45% to 30%, 
although the total number of postdocs grew from 
about 25,000 to 33,000 in the same period. 

What does this mean for biology and bio- 
medicine as a career option? It's more than an 
abstract question to Howard Garrison, direc- 
tor of FASEB’s office of public affairs, who has a 
college-bound daughter. “She's thinking of bio- 
logical sciences, so I tell her don't give up, but 
make sure youre realistic about your future,” 
Garrison says. “People are drawn to the biologi- 
cal sciences because it’s an exciting field and an 
exciting time, but people have to have a broad 
and flexible approach to their careers.” 

A huge question is why the doubling of the 


NIH budget from 1998 to 2003 seems not to 
have helped young scientists. Most of the money 
went into infrastructure rather than tenure-track 
jobs (see Nature 443, 894; 2006). The NIH dol- 
lars devoted to training fellowships did grow, but 
notas fast as the rest of its budget, and when the 
NIH budget stopped growing, so did the dollars 
(see graphic on left). 

Most NIH-supported postdoc appointments 
are now financed by research grants, not training 
grants. “If youre being supported to do research 
on research grants, but you're still a postdoc, 
there's a tension, because youre not being funded 
to be trained,” says Jodi Lubetsky of the Associa- 
tion of American Medical Colleges. 

Norka Ruiz Bravo, deputy director for extra- 
mural research at the NIH, says the agency wel- 
comes the FASEB report: “FASEB has performed 
avery useful and timely service for the biomedi- 
cal research community in highlighting this 
important issue. It is a matter of great interest and 
concern for NIH” The NIH recently instituted 
the Pathways to Independence awards, which 
help postdocs set up their own labs. 

A posting to an online careers discussion 
group puts the matter bluntly: “If you aren't 
thinking about ‘alternative careers’ before ever 
setting foot in graduate school, then you're 
being foolish” a 
Erika Check 
See Editorial, page 839. 


What babies watch: it may 
not make a difference. 


Mark Emmert has refused to 
retract the press release and 
says he stands behind the 

research. “The findings were 


considered significant enough 
to be reported in a major journal, 
and as a public institution we feel 
duty bound to make the public 


aware of these findings,” he says. 
Deborah Linebarger, an 
expert in child development 
and television at the University 
of Pennsylvania in Philadelphia, 
was asked by Disney to 
defend Baby Einstein to the 
FTC. Although believing such 
products can be useful, she 
declined. “I have concerns that 
anything called Baby Einstein, 
Genius, etcetera, is exploitive 
of a vulnerable 
population,” 
she says. 
Despite 
having “some 
methodological 
issues” with the 
paper, she says: 
“There are some 
valid conclusions 
in it that warrant 
additional 
research. I'm cautious, but it 


Daniel Cressey 
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makes sense.” | 
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Let down by 
the statistics 


Most claims that men and women are 
affected differently by disease-associated 
gene variations are poorly founded. A 
team of researchers has found that the 
data supporting such claims are often 
poorly analysed statistically or come 
from studies that were not adequately 
designed to show these links. 

“The abysmal standard of statistical 
analysis in much of genetic epidemiology 
is little short of scandalous,” says David 
Balding, professor of statistical genetics 
at Imperial College London, UK, who 
was not involved in the study. “This 
paper reveals an entire industry of 
prominently reported results that 
are largely unjustified and probably 
mostly false.” 

John Ioannidis and his colleagues at 
the University of Ioannina School of 
Medicine in Greece evaluated 432 claims 
in 77 research papers (N. Patsopoulos 
etal. J. Am. Med. Assoc. 298, 880-893; 
2007). The team applied a set of criteria 
to determine whether the papers’ authors 
had performed the correct analysis, such 
as comparing like with like, and had 
taken steps to show that the association 
was not due to chance. Worryingly, 
only 12.7% of claims satisfied these 
criteria. “There is quite a gap between 
what should have been done and what 
the journals and reviewers should have 
asked for, compared with what the 
authors did,” says Ioannidis. 

Many studies were not designed to 
test for a link between sex and gene 
variants, with researchers trying to 
extract associations from their data 
after the fact. Sample sizes were at least 
ten times smaller than they needed to 
be to yield statistically robust results, 
Ioannidis adds. 

“This paper tells us that we don’t have 
a clue whether gender is a real biomarker 
for any of the clinical areas assessed,” 
says Howard McLeod, director of the 
UNC Institute for Pharmacogenomics 
and Individualized Therapy in Chapel 
Hill, North Carolina. “Gender, as well 
as age and race, are crude ways of 
understanding the complex factors 
regulating clinical effect,” he adds. a 
Claire Ainsworth 
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Hope for axed cancer-prevention trial 


Researchers behind a major trial on breast-can- 
cer prevention whose funding was cancelled in 
June say that Elias Zerhouni, director of the US 
National Institutes of Health (NIH) has indi- 
cated that he may be willing to meet them. 

The trial was to have received US$55 mil- 
lion from the National Cancer Institute (NCI). 
It would have involved almost 13,000 post- 
menopausal women at a high risk of devel- 
oping breast cancer, and aimed to investigate 
the efficacy of two cancer drugs — letrozole 
and raloxifene — in preventing the disease. It 
was initially approved by NCI staff, external 
reviewers and, in January, by an executive com- 
mittee of senior NCI leaders. But the next day, 
NCI director John Niederhuber put the fund- 
ing on hold. He convened an ad hoc committee 
to study the costly trial further. 

Niederhuber later cancelled the funding, 
partly over fears of giving healthy women pills 
with known side effects and the public’s reluc- 
tance to take preventative medicines. Previous 


Drugs to prevent breast cancer: worth the cost? 


studies, he wrote, “have failed to change the 
practice of breast cancer prevention among 
women and their [doctors], perhaps because 
of the side effects of these drugs”. 

The scientists leading the trial at the National 
Surgical Adjuvant Breast and Bowel Project, 
based in Pittsburgh, Pennsylvania, protested 
vehemently, and accused Niederhuber of “acting 


unilaterally”. On 20 July, they hand-delivered 
a 13-page appeal to Zerhouni, asking him to 
intervene. Last week they said, “we've had some 
very positive interactions” with Zerhouni’s 
office, and they are hopeful of meeting him 
this month. 

The appeal has drawn media attention in 
the United States, and in a recent editorial The 
Lancet labelled Niederhuber’s actions “trou- 
bling” and called for an investigation (Lancet 
369, 2134; 2007). 

But Bruce Chabner of Massachusetts General 
Hospital, who chaired Niederhuber’s ad hoc 
committee, says that the NIH’s frozen budget 
has forced Niederhuber to make tough fiscal 
choices. He adds that the panel was strongly 
influenced by the fact that letrozole, made by 
Novartis, goes off-patent in 2011. “Was it worth 
spending millions on a trial that might never 
result in a drug being used for this indication?” 
he asks. “We thought it likely it wasn’t.” a 
Meredith Wadman 
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Field trials for GM food 
get green light in India 


India’s first genetically modified (GM) food 
crop is a step closer to reaching the dining 
table. The government has approved field 
trials for a strain of brinjal (aubergine) 
carrying a Bt (Bacillus thuringiensis) gene. 
Jalna-based Mahyco, the Indian venture 
of US seed giant Monsanto, claims its insect- 
resistant variety gives better yields with 
less pesticide use. To avoid possible cross- 
contamination with farmers’ crops, the trials 
will be carried out in government farms. 
But critics already campaigning against Bt 
cotton — currently the only GM crop grown 
in India — say the brinjal trials are illegal. 
Full biosafety data on the brinjal tests 
have not yet been generated, says Kavitha 
Kuruganti of the Centre for Sustainable 
Agriculture, based in Hyderabad: “The trials 
would open doors for entry of GM tomato, 
okra, mustard, corn, rice and five other food 
crops in the pipeline.” The government says 
that a committee of experts reviewed all 
available biosafety data and concluded they 
were in order. 


Gates Foundation helps 
Iraqis escape the conflict 


The Bill & Melinda Gates Foundation 

is getting into the business of helping 
persecuted Iraqi academics continue their 
research abroad. A $5-million grant will 
help support the Scholar Rescue Fund run 
by the Institute for International Education 
(ITE), which aims to raise $15 million for 
Iraq. 

The Gates Foundation is best known for 
its $33-billion bankroll and for tackling 
health problems such as malaria and HIV. 
The new grant is a one-time award, nota 
shift in focus, says a spokesperson for the 
foundation. 

As Iraqi professors come under increasing 
threat of violence, many have sought 
help to temporarily flee the country, says 
IIE president Allan Goodman. So far, 41 
grants have been awarded to Iraqis — 27 to 


Car-bomb wreckage near Baghdad University 
highlights the perils for some Iraqi academics. 


Asthmatics win payment ind 


Asthma patients in Tokyo last a 
week welcomed a cash settlement 4 
from car manufacturers and 

the Japanese government. The 
one-time payment resolved a 
decade-long legal battle in which 
the asthmatics blamed diesel car 
fumes for their illness. 

The automakers, including 
Toyota, Honda and Nissan, will 
provide ¥1.2 billion (US$10.5 
million) to the plaintiffs, anda 
further ¥3.3 billion to support 
a five-year health plan for the 
patients. The central government 
and Tokyo metropolitan 


iesel-fumes lawsuit 
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government will each contribute ¥6 billion to the medical programme. 


scientists — to pay for relocation, housing 
and research. “This is about the rescue of 
science and learning,’ says Goodman. 


WWE sues over threat to 
endangered whales 


Last month, in the Sea of Okhotsk, Russia, 
a conglomerate of energy firms lowered 

a 28,000-tonne platform for drilling for 
oil on to a base on the sea floor. They did 
it very carefully — in part to keep down 
the noise, as the last feeding grounds of 
the western Pacific population of grey 
whales (Eschrichtius robustus) lie some 

7 kilometres away. 

But environmentalists and some 
scientists say construction of the platform 
was not quiet enough. So the environmental 
group WWE and activist group the Corner 
House have taken legal action against a 
UK government department. The WWF 
claims that the department conditionally 
promised financial support for part of the 
Sakhalin-II project without approving its 
environmental impact. 

A department spokesperson calls the 
case “flimsy” and points out that the 
environmental review was in fact one of the 
conditions that would have to be met for 
approval. It is estimated that the Okhotsk 
region harbours the equivalent of 45 billion 
barrels of oil. 


Anticlotting agent gets 
gene-specific label change 


The US Food and Drug Administration 
(FDA) is changing the label on a widely used 
drug to include information about how two 
genes may affect dosage requirements. 

The FDA said on 16 August that the label 
on warfarin, a popular blood-thinner with 
the trade name Coumadin, is to be revised. 
The new version will state that a lower 
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starting dose “should be considered for 

patients with certain genetic variations”. 
Warfarin is widely used in patients with 

artificial heart valves and those at risk 

of blood clots. But it can cause massive 

and sometimes fatal bleeding. The two 

gene variants flagged by the FDA encode 

enzymes involved in warfarin’s breakdown 

and its function as an anticlotting agent. 
Although the FDA has demanded similar 

labelling on four other drugs, warfarin 

is the first major drug to be labelled in 

this way. “This means that personalized 

medicine is no longer an abstract concept 

but has moved into the mainstream,” says 

Larry Lesko, director of the FDA’s Office of 

Clinical Pharmacology. 


India to expand research 
institutes and universities 


The Indian Planning Commission has 
rolled out a seven-year plan to set up eight 
more Institutes of Technology — the élite 
science and technology universities. Also in 
the US$32-billion plan are 7 new national 
business schools and 20 Institutes of 
Information Technology. 

The push follows a call by Prime Minister 
Manmohan Singh to put Indian science on 
the global map by upgrading its research 
facilities. There are also plans for 30 more 
‘central universities’ — one in each of the 16 
states with no such institution at present, 
and 14 in states that offer free land. 


Notice to authors and readers 

Nature and the Nature journals have refined 

their policy on competing financial interests. 
Nature requires that all authors declare that they 
do or do not have competing financial interests 
relevant to their submission. If an article in Nature 
does not contain such a statement, readers can 
assume that authors declared to the editors that 
they have no competing financial interests. See 
www.nature.com/authors/editorial_policies/ 


competing.html for full details of the policy. 
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A change of gear 


at Siemens 


Europe's largest technology company is switching its priorities, even 
as it faces a growing bribery scandal. Quirin Schiermeier reports. 


Imost half-a-million people work for 
rat Siemens, the Munich-based electrical- 

engineering company with interests 
in everything from bullet trains to medical 
diagnostics. So when a bribery scandal hit the 
company late last year — forcing out several 
of its top managers — the stakes for German 
science and technology were high. 

Siemens has emerged over the past half-cen- 
tury as an icon of German engineering. But 
its new chief executive, Peter Léscher, is the 
first in the company’s 160-year history to be 
brought in from the outside. Markets have not 
responded particularly well to his July appoint- 
ment (see graph), and new disclosures about 
the extent of the bribery scandal are weighing 
heavily on the company. 

Léscher, a former manager with the pharma- 
ceutical company Merck, is an unknown quan- 
tity at Siemens. But the 49-year-old Austrian, 
who is described as determined and down-to- 
earth, hasn’t wasted much time on making an 
impression: just last month, the company struck 
two major deals that, analysts say, mark a consid- 
erable change in the direction of the company. 
It is getting out ofa stable, low-margin business 
— auto parts — and deeper into a dynamic, 
high-margin one — medical diagnostics. 

Siemens goal is to be “the world’s first full- 
service medical-diagnostics company’, Loscher 
said last month. Developing easy-to-handle 
‘integrated’ diagnostics systems is becoming a 
cornerstone of its research strategy, adds Rein- 
hold Achatz, the electrical engineer who directs 
the ten central research laboratories that the 
company operates around the globe. 

Achatz says that the company’s drive into 
medical diagnostics will aim to optimize hos- 
pital workflow by integrating the results of 
procedures such as computerized tomogra- 
phy with those of laboratory tests. “Advanced 
diagnostics produce an enormous amount of 
medical information,’ he says. “Translating this 
information into early diagnosis of diseases 
requires strong knowledge management and 
very efficient processing technologies.” 

Siemens says it spent some €5.7 billion 
(US$7.7 billion) on R&D in 2006 — more, 
it says, than IBM, Matsushita or, indeed, any 
rival company. Its annual sales of €87 billion 
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rank it second in the electrical and electronics 
engineering sector, behind General Electric 
(€129 billion). 

The research carried out by the 1,900 staff in 
Achatz’s labs lies at the more ‘basic end of the 
spectrum, and accounts for just 5% of the Sie- 
mens overall spending on R&D. The company 
employs a total of almost 50,000 research and 
development staff, including some 30,000 com- 
puter programmers, based in 30 countries. 

Collaboration with universities is an impor- 
tant part of the company’s research strategy, 
Achatz says. Each year, more than 1,000 projects 
are launched in conjunction with academic 
researchers all over the world. Siemens initially 
provides money, infrastructure and a business 
plan if there is possible commercial potential. 
After a year or so, the firm decides whether or 
notto go forward with the project. “That way we 
make sure that no ideas get lost,’ says Achatz. 


Home-grown success 

Germany remains the bedrock of innovation in 
the company, with almost half of the research 
staff based there. Many pioneering inventions, 
from electric locomotives in 1879 to whole- 
body magnetic resonance imaging in 2004, have 
been conceived in Siemens’ German labs. 

But with Asian markets becoming increas- 
ingly important, Siemens’ research footprint 
is going global. “Some 300 million people in 
China and India may already buy and use our 
products, but 2 billion others don't,” says Ach- 
atz. The company has hired about 300 scientists 
and engineers over the past five years at labora- 
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tories in Beijing, Shanghai and Bangalore. 

The rise of Asian technology corporations 
poses a strategic threat to Siemens, which it 
hopes to counter with its extensive know-how 
based on operations in 190 countries. “We have 
a competitive advantage in that knowledge 
from different cultures leaves its mark on our 
products,’ says Achatz. 

He adds that Siemens’ researchers haven't 
felt much impact from the bribery allegations 
that have shaken up other parts of the com- 
pany. “We are very aware that trustful partner- 
ships are of the essence when contracting out 
research,’ Achatz says. “All of our deals need to 
be legally impeccable” 

The problem first emerged last November, 
when police searched Siemens offices in Ger- 
many and Austria for evidence that corporation 
officials had bribed customers and trade-union 
negotiators. Sham businesses in several Euro- 
pean countries came to light, and around €420 
million of suspicious payments are currently 
being reviewed by lawyers and public prosecu- 
tors. But according to a report published last 
week by the newspaper Siiddeutsche Zeitung, 
internal reviewers have found evidence that 
dubious transactions amount to more than 
€1 billion — and that several divisions of the 
company are affected. 

Two board members were temporarily 
imprisoned last winter. And in April, when 
the US Securities and Exchange Commission 
announced its own investigation, chief executive 
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BIOTECHNOLOGY BLOW Amgensays it will cut between 2,200 and 2,600 jobs in 
response to falling sales of its anaemia drug, Aranesp (see Nature 447, 899; 2007). 
These will be the first-ever layoffs at the Thousand Acre, California-based company, 
whose phenomenal rise has — together with that of Genentech in San Francisco — 
provided the dominant corporate model for the biotechnology industry worldwide. 
But Amgen has come under pressure from Wall Street to cut costs: announcing a 
restructuring on 15 August, the company said that the proportion of revenue spent on 
R&D would drop from last year’s 23% to about 20%. 


WIND-ENERGY BOOM The United States is set to install more than 3,000 MW 

of wind-power capacity this year, up from 2,450 MW last year, according to figures 
compiled by the American Wind Energy Association. Total installed capacity now stands 
at about 12,600 MW: just over 1% of total US electrical generating capacity. Analysts 
say that the wind-energy market is maturing, with lenders and investors increasingly 
happy to support new projects. 


PAINKILLER PULLED A Novartis painkiller that is a rare surviving member of the COX2 
inhibitor class of drugs has been withdrawn from the market in Australia, where an 
estimated 60,000 people were taking it. Australian regulators cancelled the registration 
of the drug, Prexige (lumiracoxib), on 11 August, saying that they had received eight 
reports of serious liver toxicity, including two deaths and two people who required liver 
transplants. The Swiss company sells the drug in over 50 countries including the United 
Kingdom, and is applying for marketing approval in the United States (see Nature 448, 
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Klaus Kleinfeld and chairman Heinrich von 
Pierer were forced to step down. 

Kleinfeld — who was chosen last week to run 
the aluminium company Alcoa — had already 
sought to reduce the range of company activi- 
ties, giving up, for example, on the mobile- 
phone business. Siemens shares had doubled 
in value during his watch. 

Léscher must now complete the restruc- 
turing process while restoring the company’s 
reputation. He agreed to buy Dade Behring, 
an Illinois-based medical-diagnostics supplier, 
for $7 billion on 26 July — and to sell VDO, 
Siemens’ auto-parts unit, for $7.6 billion on 
the same day. The markets didn't like the deals 
much, believing that Siemens could have done 
better on the pricing of both. 

Siemens had already purchased German 
company Bayer’s medical-diagnostics unit for 
almost $6 billion in 2006, as well as two US 
companies in the same sector. The acquisition 
of Dade Behring makes Siemens the second- 
largest player, after Roche Diagnostics of Swit- 
zerland, ina global diagnostics market worth 
some $60 billion. 

Although the market has been tough on Sie- 
mens over the past month, most analysts accept 
that acquisitions make sense in the long term, 
and will improve the firm’s strategic position 
in medical technology. That, they believe, will 
help Siemens to maintain its place as a lynchpin 
of the German economy and one of the world’s 
most successful technology companies. a 
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The global plunge in stock markets over 
the past three weeks has taken the wind 
out of the sails of clean-energy stocks, 
ending a powerful run that saw their 
value grow by some 40% in the year to 
mid-July. 

The WilderHill New Energy Global 
Innovation Index (NEX), which tracks 
anumber of companies around the 
world with strong interests in renewable 
energy, plateaued in July and then 
tumbled this month, along with the 
general stock markets. Some observers 
say that, for clean-energy stocks, this 
was an overdue correction. 

The index has been hit hard by the 
market turmoil, says Michael Liebreich, 
chief executive of London-based New 
Energy Finance, which co-compiles the 
index with WilderShares of California. 
“It's taking a little bit of a beating,” he 
says, “but it’s still ahead on the year.” The 
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index started 2007 at 289 and hit 400 
before retreating this month (see above). 

The early summer had been particularly 
strong for US clean-energy companies, 
which had (and indeed still have) high 
hopes that the energy bill winding its 
way through Congress will provide fresh 
incentives for investment in wind and 
solar power. Rob Wilder, president of 
WilderShares, says that just before this 
month's general market slump, investors 
were betting that the bill would pass — 
and boost demand for renewable energy 
sources in the United States. 

“We expect to see further turbulence 
and volatility,” says Liebreich, who has 
been warning for some months that the 
index was due to dip (see Nature 447, 

17; 2007). “But the fundamentals of 

the industry are very strong. In a way | 
actually welcome the correction.” | 
Colin Macilwain 
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Hitting the on switch 


i) | Mi, 


1 ined stance: Robert Place (left) and 


Researchers in San Francisco have findings that suggest a whole new side to RNA interference. 
Erika Check reports on their attempts to make a revolutionary field more revolutionary still. 


hat looks perfect,” murmurs Rob- 
ert Place as he watches a smooth 
line trace across his monitor. “Tt 
never works this well” 

Place, a molecular biologist at the Veterans 
Affairs Medical Center and University of Cali- 
fornia, San Francisco, is using a spectropho- 
tometer to measure the purity of a series of 
samples. Every sample contains a tiny drop of 
micro RNA (miRNA), a type of genetic regula- 
tor that dampens gene expression — or so the 
story goes. The experiment Place is absorbed 
in is the last he must complete before he sub- 
mits a publication that could upend that story. 
He and his colleague, Long-Cheng Li of the 
University of California, San Francisco, think 
they have found some miRNAs that boost, 
rather than silence, gene expression in cells. 

Their work could shake the foundations of 
one of the hottest topics in biology — RNA 
interference — which studies how short pieces 
of RNA regulate the expression of genes. Place 
knows that his experiment will draw intense 


scrutiny from other researchers, and therefore 
it has to go perfectly. And as far as he can see, 
it has. His spectrophotometer displays a series 
of flawless curves, free of impurities; it looks as 
if he and his colleagues are finally seeing the 
pay-off after a three-year saga of frustration and 
exhilaration. 

Since about the turn of this century, scien- 
tists have realized that 50 years 
of focus on DNA had blinded 
them to the wide range of bio- 
logical roles held by its chemi- 
cal cousin, RNA. The old view 
was that DNA contained life’s 
instructions, proteins carried 
them out, and RNA served as 
little more than a go-between. 
It's now become clear that RNA 
has vast potential for controlling 
how cells interpret the instructions embedded 
in the genome. 

The RNA revolution began in 1998 with the 
discovery honoured by last year’s Nobel Prize 
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in Physiology or Medicine that small strands 
of RNA stuck together in pairs, like the strands 
that make up DNA, could turn off specific genes 
in roundworms'. RNA interference was born. 
In 2001, scientists discovered that the proc- 
ess works in mammals, too’. They found that 
interference could be triggered by ‘short inter- 
fering RNAs (siRNAs) or ‘short hairpin RNAS 
(shRNAs). Both work by using 
a pair of scissors, actually a 
complex of proteins known as 
RISC, to cut apart the longer 
messenger RNAs (mRNAs) 
that take information from 
the genes to the cell’s protein- 
making machinery. The small 
RNAs target mRNAs because 
their sequences match, and by 
destroying the messenger, RNA 
interference stops genes from making proteins. 
Both siRNAs and shRNAs are being tested in 
clinical trials against conditions in which an 
overactive gene needs to be shut down’. 
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Scientists then discovered miRNAs, which 
also trigger the interference pathway, but 
unlike siRNAs and shRNAs are naturally 
encoded by cells’ DNA. So far, more than 500 
miRNAs have been identified in the human 
genome’. They can act like siRNAs, using the 
same protein complexes that slice through 
messenger RNA. But for the most part, 
miRNAs are content to muzzle the message 
or mark it for degradation, rather than chop it 
up (see graphic). 

All these techniques were still quite new 
back in 2004, when Li began to investigate 
RNA at the San Francisco Veterans Affairs 
Medical Center in the laboratory of Rajvir 
Dahiya, a urologist. Li, also a urologist, was 
studying epigenetics — stable modifications to 
the genome that change howit is read without 
altering its sequence. 

He was particularly interested in DNA meth- 
ylation, the addition of chemical tags called 
methyl groups to regions of DNA. Methyl tags 


BEYOND INTERFERENCE 


often silence nearby genes, which can be disas- 
trous for genes that usually suppress tumours. 
So Li was trying to find ways to reverse this 
process. He decided to try an RNA-based 
technique to control methylation that hadn't 
yet been used in animals, as far as he knew, 
although it had been demonstrated in plants 
a decade earlier’. 


Unexpected boost 

Li purchased two pieces of double-stranded 
RNA that were complementary to a DNA 
control sequence — known as a promoter 
— upstream of the gene that encodes the 
tumour-suppressor protein E-cadherin. Then, 
his colleague Hong Zhao added the double- 
stranded RNA to two lines of prostate cancer 
cells and measured how it affected E-cadherin 
expression. Three days later, Zhao noticed 
something shocking: the experiment had 
boosted levels of the E-cadherin protein by 4- 
to 14-fold. “I couldn't believe it? Li says. 


RNA interference (RNAi) has been one of the hottest fields of research in the past decade. As more details are 
clarified about the various triggers — the short RNAs that do the dirty work — more questions arise such as how RNA 
interference works in the nucleus to stop transcription (or possibly amplify transcription). 
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The work defied everything that scientists 
had ever reported about small, double-stranded 
RNAs. They were only supposed to dampen 
gene expression. Li seemed to have stumbled 
on a phenomenon that could rewrite the text- 
book understanding of RNA interference and 
might offer new therapeutic potential. 

Li re-ran the experiment many times, each 
time with the same result. He found that two 
other cancer-related genes, VEGF and p21, could 
also be activated by double-stranded RNAs. The 
evidence seemed solid. At least in the prostate 
cancer cells he was working with, short, double- 
stranded RNAs activated genes. “It was so easy to 
observe — I didn’t understand how people could 
have ignored it for such a long time;’ Li says. 

But apparently they had. So Li knew he 
was going to have to put together a watertight 
case to convince other scientists. He started 
to realize how difficult that would be when 
he first submitted his work for publication 
to Science in August 2004. It was promptly 
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rejected. He then submitted his paper to 
Nature that December; when it was rejected, 
he resubmitted it with new data in April 
2005. He presented his findings at the annual 
conference of the American Association for 
Cancer Research in Anaheim, California, in 
May 2005, and the reception wasn't warm. “I 
got alot of sceptical questions,’ Li says. Then, 
after an extensive delay, Nature again rejected 
Li’s paper in December, 2005. Without evi- 
dence for a mechanism, he says he was told 
the results weren't convincing enough. 

Unsure how to proceed, Li got some unex- 
pected help in the form of Place, who joined 
the lab in October 2005. Fresh from earning 
his PhD, Place was excited by the novelty of 
Lis findings. But he could tell that the lab had 
a lot of nitty-gritty molecular biology work 
to do. Place helped plan experiments to help 
convince the sceptics. For instance, he helped 
Liand his technician Deepa Pookot to perfect 
assays, called immunoblots, that detect pro- 
teins so that they could measure how RNA 
activation boosts levels of E-cadherin and 
other proteins. 


Filling in the details 

Place and Li also started to think about the 
mechanism behind RNA-directed gene acti- 
vation, a major stumbling block for the edi- 
tors and reviewers who had seen the paper. If 
it worked in the same way as gene silencing, 
the sequence of the trigger RNA should mat- 
ter, and the pair found that it did. Changing 
five letters at one end of the 21-letter sequences 
rendered them inactive. 

The pair also found that activation used 
some of the same key proteins 
that are involved in silencing, 
such as Dicer, which cuts up 
strands of RNA so that they can 
be used by RISC to target mRNA. 
They began to experiment with 
activation, trying to find out 
what worked best. “We started 
to modify the RNA duplexes to 
get optimal activation, tweaking 
their chemical structures,’ Place 
recalls. “There came a point 
when we were working in syn- 
chronization, and we really started to click” 

Li resubmitted the work to Science with 
Place’s additional molecular biology results, 
but it was again rejected. The letter he received 
said that because the work “would represent a 
substantial paradigm shift’, the evidence just 
wasn't strong enough. Again, editors required 
demonstration of a mechanism. 

So, in June 2006, Li talked to leaders of the 
RNA- regulation field at a meeting in Cold 
Spring Harbor Laboratory, New York. Li recalls 


“Last time | talked 
about activation ata 
meeting, a couple of 
leaders in the field 


jumped all over 
me and said it 
wasn't true.” 

— David Corey 


TEPETTTTTTTTT 


é 


VII 


© 4 


asking David Bartel 
of the Whitehead Institute for Biomedical 
Research in Cambridge, Massachusetts, if he 
thought RNA activation — not just inhibition 
— was possible. Bartel said he didn't think so, 
Li says. Bartel says he doesn’t remember the 
conversation, “I was probably just trying to 
find out how solid the evidence for activation 
really was,’ he explains. Li was after all a junior 
researcher in an unrelated field, and he hadn't 
been able to publish his findings. 

Exasperated, Li considered sending his find- 
ings to an online journal with less stringent cri- 
teria than Nature or Science. But Dahiya thought 
that the work would get buried in obscurity, and 
convinced Lito try the Proceedings of the National 
Academy of Sciences instead. In August 2006, the 
lab submitted its work there 
— and finally got its break- 
through. The journal reviewed 
it in weeks and published the 
work online in November’. 
Two months later, vindication: 
another lab, led by biochemist 
David Corey of the University 
of Texas Southwestern Medical 
Centre in Dallas, published a 
paper essentially duplicating 
the results in Nature Chemical 
Biology’. “It was really good for 
us,’ Place says. “Before that, it seemed like every- 
one just thought we were crazy. 

Back at the Veterans Affairs Medical Center, 
Place and Li were now working closely together, 
although Li was preparing to move to his own 
lab at the University of Califorinia, San Fran- 
cisco. They had already devised a series of 
experiments that they hoped would bolster their 
activation hypothesis. Although some changes 
to the sequence can render RNA useless, oth- 
ers fine-tune the activation. The trigger doesn't 
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have to 
match the target 

sequence exactly, and a few tweaks 

to the double-stranded RNA produces less acti- 

vation. And that’s exactly how miRNA works. 

So the pair started hunting for a miRNA that 

might activate E-cadherin. It wouldn't prove 

that activation was a natural phenomenon, but 

it would strengthen the case, because miRNAs 

are encoded in the genome. 

Place and Li used a bioinformatics tool to 
search for miRNA in the human genome 
that had sequences roughly complementary 
to the E-cadherin promoter sequence. They 
found a handful of candidates and transferred 
them into the prostate cancer cell lines. The 
experiment was so simple they half-expected 
it wouldn't work — but it did. The pair found 
one miRNA, miR-373, that boosted E-cadherin 
expression. “When that worked, we were totally 
pumped, because it was a potential example of 
natural function,’ Place says. 


On the right track 

But it still wasn’t proof. So Place designed an 
miRNA precursor that, according to the model 
of miRNA biogenesis, should work as well as 
the miRNA itself. It did. Place also found that 
ifhe knocked out Dicer, then miR-373 stopped 
working, and that Dicer could also activate 
another protein with a promoter sequence 
similar to E-cadherin’. All these experiments 
supported the idea that miRNA could use the 
interference pathway to activate genes. It was a 
strong hint that Li and Place were on the right 
track. On 6 August, they submitted the work 
to Nature Chemical Biology. 

The question now is how this work will be 
received. Many are still sceptical about activa- 
tion, as David Corey can attest. “Last time I 
talked about this at a meeting, a couple of lead- 
ers in the field jumped all over me and told 
me it wasnt true,” Corey says. “We've received 
grant and manuscript reviews that seem to 
express irritation more than anything else. I 
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just wish they would look closely at our data.” 
Indeed, it sometimes seems as though the 

idea of activation is struggling because it con- 

tradicts the interference dogma. The RNA 

interference field is quite young, but already 

seems to have acquired a certain amount of 

inflexibility. On scanning through archives of 

biology message forums, Place has found that 

other scientists — often graduate students — 

have also seen evidence of RNA activation. But 

they have been encouraged to discount it. 
Phillip Sharp, a Nobel-prize-winning biolo- 

gist whose lab at the Massachusetts Institute 

of Technology in Cambridge 

pioneered much of the work 

on RNA interference, admits 

that RNA-mediated activa- 

tion might be possible, but 

says that “the results I have 

seen do not prove this”. Echo- 

ing the demands of manus- 

cipt editors, he says, “I think 

further papers on the subject 

must address the mechanism 

if they are to be published in a high-profile 

journal” 


A new pathway 

As the field has moved apace, such requests are 
warranted, and providing proof of a mecha- 
nism is Li and Place’s main challenge. Some 
have suggested that activation is simply inhi- 
bition in disguise. It could be the accidental 
result of silencing an upstream repressor or 
of blocking another silencing RNA. Although 
they can’t rule out these possibilities, from 
their experiments, they say that these mecha- 
nisms look unlikely. For instance, they can 
elicit activation predictably at specific genes 
by targeting their promoters. 
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In this together: Long-Cheng Li (left) is briefly reunited with the team from the Veterans Affairs 
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Medical Center. Left to right, Li, Emily Noonan, Robert Place, Deepa Pookot and Rajvir Dahiya. 


There are intriguing differences between 
the known silencing pathways and the 
observed activation phenomenon. Silencing 
is triggered within hours and ceases in about 
seven days, whereas activation takes days to 
appear but can last for weeks. The different 
kinetics suggest that some mystery process is 
involved, Place says. “People say this could be 
RNA interference with another name, but it’s 

so blatantly different.” 

Major questions also remain 
about how RNA regulation 
might act at gene-promoter 
regions. In the classic inter- 
ference pathway, the RISC 
complex guides siRNAs or 
miRNAs to a target mRNA in 
the cell’s cytoplasm. But to reg- 
ulate a promoter, a small RNA 
would have to sneak into the 

cell nucleus, where DNA is transcribed. There 
is mounting evidence that this happens in 
silencing pathways, but again the mechanism 
is unclear. 

In 2004, two groups published papers that 
suggested that siRNAs that target gene-pro- 
moter regions can silence genes if they are 
delivered into cell nuclei. Although one group 
has since retracted its paper, the other group, 
who published in Science®, showed that the 
inhibition was accompanied by epigenetic 
marks associated with silencing. Kevin Mor- 
ris, of the Scripps Research Institute in La Jolla, 
California, an author of that paper, has contin- 
ued to study how this occurs, and sympathizes 
with Li and Place’s position. “I was there in 
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2004, when 50% of the people love your work 
and the other 50% think you're full of it?” Mor- 
ris says. “It’s a frustrating place to be” 

Frustrating, indeed. As the deadline for this 
news feature approached, Place received word 
that the group’s manuscript on miRNA activa- 
tion had been rejected. The reason given: with- 
out proof of a mechanism, the evidence isn't 
substantial enough. Li is still adamant that the 
field will come around. “We have no doubt that 
RNA activation is an endogenous mechanism,” 
he says. And Place seemed unsurprised. “We 
knew the mechanism would be the sticking 
point. That’s the hardest part to prove.” But, he 
predicted, if the group can just get someone 
to just take a look at its data, the strength of 
its evidence will prevail. “If we can get it into 
review, we'll be okay,’ Place says. 

When RNA interference first hit the scien- 
tific radar, it was a slow climb from something 
written off by many as artefact to a revolu- 
tionary paradigm. That same uphill battle 
confronts Place, Li and their collaborators as 
they try to rewrite, or at least refine, the revo- 
lution. Call it what you will — stubbornness, 
confidence or optimism — this group just isn’t 
going to give up. 

Erika Check writes for Nature from San 
Francisco. 
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C. PICKENS 


ROOM 


MAKING 


Elephant populations are soaring in some parts 


of Africa. Emma Marris discovers there's no 
single way to fit them in amid the people. 


he car creeps towards the left as the 

driver tries to get as close as possible 

to the group of elephants foraging at 

the road’s edge. The elephants walk 
silently, communicating in companionable 
purrs. The loudest noise is the crackle of the 
tough foliage they are eating. 

“You're crowding Agatha and she’s going 
to have to press into the bush to get by,” says 
Katie Gough from the back seat. Gough, based 
at Nelson Mandela Metropolitan University 
(NMMVU) in Port Elizabeth, South Africa, has 
been studying these animals for four years, and 
can tell most of the elephants apart by the wear 
and tear on their ears or by idiosyncratic wrin- 
Kles. She is right about Agatha, who, slowly 
moving her enormous body into the thorny 
shrubs, turns her head and gives the occupants 
of the car a look that everyone reads — scien- 
tific prohibitions on anthropomorphization 
be damned — as reproachful. 

Agatha’s home, Addo Elephant National 
Park in South Africa, has too many elephants. 
In 1954 there were 22 animals in a park of 
about 2,300 hectares. They were the remnants 
of a herd hunted nearly to extinction by one 
hired hunter in 1919. He was carrying out 
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the orders of local orange growers who were 
sick of elephants gorging themselves on their 
crops. Today, there are around 460 animals 
in about 26,000 hectares — or roughly dou- 
ble the 1954 density. Throughout its history, 
densities have waxed and waned; in the main 
camp of Addo they now stand at about three 
elephants per square kilometre. One estimate 
of the maximum number of elephants this area 
can sustainably support is about 0.5 elephants 
per square kilometre’. 

“Twenty or thirty years ago, things were 
black and white,” says Graham Kerley, the ele- 
phant expert at NMMU who drove too close 
to Agatha. “You had elephant management 
and you had laissez-faire.” Although faced by 
an excess of elephants, Kerley is clearly still 
charmed by them, as he is by all the carefully 
managed animals at Addo — among them 
the elegant kudu, zebras with tawny rumps, 
and the immense ostriches sprinting along 
the road. 

These days no one believes in laissez-faire. 
The effects that too many elephants have on 
their environment are easy to see. Climbing 
the stairs over a fence to cross from an ele- 
phant area into a pachyderm-free zone, the 
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landscape switches from patchy shrublands to 
a lilliputian forest containing a wide variety of 
plants, including spectacular aloes. 


Giant appetite 
In places where the Addo elephants have 
already consumed the juicier vegetation, 
they now feed on bushes such as spekboom, 
sweet thorn and the bee-sting bush — plants 
as tough and spiny as their names suggest. In 
the Addo area, elephants may threaten some 
168 species of plants with extinction’. These 
include the geophytes and the aptly named 
succulents, including the jade plant. And 
as the miniature forest turns into miniature 
savannah, other animals will see their habi- 
tats change. Animals that live in areas dense 
with vegetation, such as the cape grysbok and 
bushpig, will have their habitats fragmented. 
As the Addo experience shows, if your goal is 
to preserve an entire ecosystem and not just the 
elephants within it, pachyderm numbers have 
to be controlled. But where park managers, sci- 
entists and government experts disagree on the 
best means of managing elephant populations, 
the elephants can find themselves left in limbo. 
For an outsider, it’s a challenge to switch 
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from seeing African elephants as much of the 
Western world usually does — as majestic, rare 
animals dying at the hands of villainous ivory 
poachers — to a more nuanced, more African 
conception. Depending on who and where you 
are in Africa, elephants may also be livelihood- 
destroying crop-raiders, expensive tourist 
lures, environmental menaces, or dinner. 


Different countries, different stories 
So it comes as no surprise that Africans can 
become irritated when told how to manage 
their elephant populations by interested par- 
ties abroad. Well-meaning and 
sometimes technically expert 
outsiders are constantly offer- 
ing opinions on whether to cull 
animals, whether to try contra- 
ception, whether some kinds 
of ivory should be legally sold, 
and so on. Often, onlookers will 
treat the whole continent as a 
single case, when each country 
and each area has its own problems to solve. 

Most African elephants still live outside 
protected areas, and so they increasingly come 
into fraught contact with Africa’s expand- 
ing population. Botswana has an estimated 
150,000 elephants roaming without major 
barriers between them and pastoralists herd- 
ing livestock. In Namibia, elephants range free 
of people in the western deserts, but they butt 
heads — sometimes literally — with agricul- 
turalists elsewhere in the country. In other 
countries, elephants have been fenced into 
reserves or hemmed into protected areas by 
development as more land is built on. 

Across southern Africa as a whole, elephant 
populations are increasing by about 4% annu- 
ally, while herds in central and western Africa 
continue to struggle with poaching and habi- 
tat fragmentation’. “I could tell you a different 
story for every African elephant range,” says 
Holly Dublin, an elephant expert who is chair 
of the Species Survival Commission at the 
World Conservation Union. 

Nowhere do humans and elephants pack 
in more tightly than in South Africa. So it’s 
tempting to see the country as an example of 
how elephant management might look across 
Africa in the future. But the country is different 
to others in that there are fewer than half-a- 
dozen unfenced elephants nationwide. These 
are the elephants of the Knysna forest, which 
have held on to their freedom only by virtue 
of living in an inaccessible valley. All the other 
elephants are descendants of remnant herds, 
living behind strong fences built with materials 
such as railway ties and mineshaft cabling. 

“South Africa cannot be considered indicative 
of the challenges facing elephant conservation 


“Opposition to 
lethal control is 


usually favoured by 
affluent people.” 
— Hector Mogame 


in the rest of Africa,’ says Dublin. “It is more 
like living in America and trying to take care 
of elephants. For the most part, rural commu- 
nities are not trying to make their livelihoods 
right next door.” 

South Africa itself has only a small fraction 
of the continent’s elephants. But to judge from 
media reports, they might as well have almost 
all of them. Coverage of the culling contro- 
versy is largely responsible for this. Until 
1994, managers at South African National 
Parks (SANParks) would shoot elephants 
in places such as Kruger National Park, far 
north of Addo on the Mozam- 
bique border, whenever their 
numbers got too high. The 
slaughtered elephants would 
be given to local people as a 
free meal. Other elephants 
were transported to parks that 
wanted more elephants or sold 
to private reserves. 

Before the modern practice 
of culling whole family groups was introduced, 
culling occasionally resulted in orphans that 
grew up to behave antisocially. For example, 
some orphan elephants, which were moved 
from Kruger to other parks, famously became 
maladjusted juveniles that went around killing 
rhinoceroses’. 


Getting ina flap 
In 1994, culling was stopped because of the 
objections of local and foreign animal rights 
groups. Elephant densities have since increased, 
and in Kruger, where there are now about 0.63 
elephants per square kilometre’, large bulls 
flex their muscles by pushing over baobab and 
marula trees. And after a decade of debate, 
in 2004, SANParks announced that a whole 
suite of tools — including culling — should 
be considered for managing elephant numbers. 
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This news was not received well, especially by 
wildlife conservation groups, who claimed that 
culling was inhumane, even unethical. 

Hector Mogame, executive director of Con- 
servation Services at SANParks, Pretoria, 
doesn't see it this way. “The issue of ethics is 
about power,’ he told the Society for Conserva- 
tion Biology in Port Elizabeth this July. “The 
viewpoint of opposition to lethal control is 
usually favoured by affluent people — people 
with money.’ Citing the furore over SANParks’ 
plan to resume culling, he says the debate is 
about the people on the ground being trumped 
by the rich and powerful. 

SANParks officials have historically had a 
relatively free rein to manage the animals within 
their park boundaries. But after the 2004 outcry, 
South African government ministers stepped in 
and asked SANParks to provide scientific justi- 
fication for its culling plan. In 2005, Marthinus 
van Schalkwyk, minister of tourism and the 
environment — and these two are very closely 
linked in South Africa — called for a scientific 
round table representing different views on the 
elephant management debate. Mogame and 
Kerley were two of those invited. 

In January 2006, the Elephant Science 
Round Table decided that Kruger didn’t have 
enough data at that moment to show culling 
was necessary, but smaller reserves experi- 
encing “bigger pressures” might need to do 
something immediate. The group has contin- 
ued to meet unofficially to advocate for a major 
research push on elephant management. 

SANParks has moved away from suggestions 
that there is an ideal number of elephants. “The 
maintenance of biodiversity is best achieved by 
permitting — or if 
appropriate 
actually 
encour- 


aging 


Changing diet: 
elephants move 
on to sweet thorn 
(above) once 

the jucier aloes 
(left) have been 
consumed. 
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— variation in time 
and space, rather than 
attempting to manage 
for stability,’ says Wanda 
Mkutshulwa, head of 
communications at SAN- 
Parks. As of now, the official 
management strategy of SAN- 
Parks is “wait and study”. 

In the meantime, the 


elephants are continuing Holly Dublin believes all the options 
for managing elephant populations 
need to be available. 


to restructure their habi- 
tat. Their movements are 


expect to see at least 
one or two massive grey 
beasts having a drink. 
But there are at least 
two problems with 
removing the water- holes. 
At Addo, water -holes had 
to be put in because the river, 
which would have been the 
water source for elephants 
in the area, is not part of 
the park. Without the water 
provided by SANParks, the 


dictated by the presence of 

food and, crucially, water. Around any surface 
water where elephants drink, you will find a 
‘piosphere;, a circle of reduced vegetation cover 
demonstrating the landscape-changing power 
of these animals. The shrubs remaining have 
just a few leaves, and seem as if they are cling- 
ing to life. There is no grass, and warthogs are 
pale from dust bathing. The word ‘desertifica- 
tion’ springs to mind. 

Adrian Shrader of NMMU argues that 
if you pack water-holes and elephants in so 
densely that the piospheres begin to overlap, 
landscapes may begin to suffer irreversible 
losses. But even without such pressures, a high 
number of elephants will over time reduce 
plant diversity. 


Which path to tread? 

For now, park managers still have several other 
options besides culling that they can actively 
pursue. But which is best? Most elephant 
experts have a pet intervention they study or 
favour. Round-table member Bruce Page, at 
the University of KwaZulu-Natal, Durban, is 
keen on contraception, which has promise but 
is extremely awkward in execution. It requires 
identifying elephants, and then shooting hor- 
mones into them from a truck or helicopter. 
Field trials at Kruger recorded reasonable suc- 
cess rates. 

In Page’ population models, contraception 
and culling affect population structure in dif- 
ferent ways. The proportion of the population 
under ten years of age increases under culling, 
but decreases under contraception. When pop- 
ulation control ceases, you get a baby boom 
after culling that you dont get after the cessa- 
tion of contraception. Still, Page estimates that 
under operating conditions, the best one can 
hope for is to prevent about 75% of the females 
from giving birth. 

Other experts call for the removal of artifi- 
cial water sources, which are sometimes used 
to lure the animals to within sight of tourists. 
In Addo, one can have lunch at the park res- 
taurant and then stroll across the car park to 
a viewing area overlooking a water-hole, and 
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Addo elephants would per- 
ish along with all the other mammals. Another 
problem is that when you use water scarcity 
to limit elephant numbers, they die of thirst. 
This would be an even worse public-relations 
disaster than shooting them, 
according to Shrader. “It would 
hit the evening news: skeletal 
elephants and their babies.” 

More controversially, Rudi 
van Aarde of the University of 
Pretoria advocates knocking 
down all the fences you can 
and allowing natural metap- 
opulation dynamics to manage 
elephant numbers. The idea 
is to get as close to ‘natural’ as possible. “It is 
when you put up hard fences and dig the land 
full of water-holes, that we are creating prob- 
lems that we shouldn't try to solve through the 
barrel of the gun or through contraception,” 
says van Aarde. 

Critics note that many people would be 
angry, not to say frightened, if the fences 
between their village and a herd of elephants 
were removed. Van Aarde insists that enough 
unpopulated land could be cobbled together to 
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“| suspect that 
inthe end we 
will come up 


with a uniquely 
African solution.” 
— Graham Kerley 


avoid this, and that if elephants caused prob- 
lems at the extremes of their range then they 
could be shot by villagers. But this would put 
the onus of management on local people with a 
lot to lose. “Maybe with time this will prove too 
extreme to be applicable,” he admits. 

The argument over elephant ranges ulti- 
mately depends on how you draw the maps. 
Many people feel that the areas currently occu- 
pied by elephants in Africa are as large as can 
be reasonably expected. With human numbers 
growing as they are, there is no space for the 
elephants to expand into. 

And even when range expansion has been 
tried, there's scant evidence that taking down 
the fences encourages elephants to populate a 
new area. In many cases, they like it where they 
are. If they can get enough water, why move? 
The removal of a fence on one 
side of Kruger to create a park 
that crosses into Mozambique 
has reportedly boosted the tran- 
sit of people in stolen cars much 
more than it has increased the 
movement of elephants. 


Packing their trunks 

There are also proposals to 
move elephants from areas 
where they are living in high densities to areas 
where numbers are fewer, but according to 
Dublin, interest in that idea has “cooled down” 
in recent years. Moving an elephant is — no 
surprise here — an enormous undertaking. 
One must knock them out and then heft them 
around in trucks chaperoned by experienced 
personnel. It is very expensive. Another con- 
cern, according to Dublin, is the destination. 
For example, ifit is sparsely populated because 
poachers are killing elephants there, why move 


Effects of drinking: elephants can dramatically alter the landscape around their water sources. 
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Pushing for research: Graham Kerley works with officials at Addo Elephant National Park, which 


contains more elephants per square kilometre than some experts estimate it can sustainably support. 


the beasts into a death trap? Some non-gov- 
ernmental groups such as the Massachusetts- 
based International Fund for Animal Welfare 
and the South Africa-based Peace Parks Foun- 
dation have funded relocations, but usually on 
the scale of tens of animals. The Kenya Wild- 
life Service, which manages elephants in that 
country’s protected areas, relies heavily on 
relocations, as it does not cull. 

Ask any elephant expert about the various 
management options, and they will usually 
say they may need to use all or some, depend- 
ing on the situation. “You cannot expect that 
one size fits all?” says Dublin. “If you take away 
any of the tools, that is not the best thing for 
elephants or for Africa” 

Above all, elephant managers — be they 
governments, park managers, or consult- 
ing scientists — have to decide the reasons 
for managing elephant numbers. If it is to 
achieve maximum biodiversity, one strategy 
and population density of elephants might be 
best. If it is for iconic savannah landscapes, or 
to increase the genetic diversity of elephant 
populations, other routes may be needed. As 
Kerley says, “if you just want an elephant view- 
ing park, you might as well just tear all this out 
and plant alfalfa”. 

Part of the challenge is that we have only 
vague ideas of elephant numbers and move- 
ments in Africa prior to Europeans showing 
up with flintlock elephant guns and sailing 
away with holds full of ivory. Some elephants 
migrate. Females and their offspring move 
about together; bulls often live alone. It is no 
easy task to establish a baseline — whether 
for elephant numbers or behaviour, or for the 
landscapes in which they lived. 

Even the parts of Addo park that are 


untouched by elephants are not a good guide 
to what the forests looked like before Euro- 
pean colonization, because there would have 
been some elephants around. “What we don't 
have information on,’ says Kerley, “is what 
the landscape should look like and how many 
elephants will achieve that.” 

Some argue that culling, although the most 
controversial intervention, may also be the 
most ‘natural’. Those who study elephants say 
that they do not change their reproductive 
behaviour much in the face of food 
scarcity — although van Aarde 
and others maintain that ele- 
phants do this in savannah 
ecosystems. Other herbiv- 
ores will have their first 
offspring later in life 
and have subsequent 
offspring farther apart 
in lean times. Not so 
the exuberant elephant, 
which, given enough 
foliage to start with, will 
reproduce and consume 
its way right into a nasty 
population crash. So perhaps 
something else was limiting their 
numbers in ages gone by. 


Mammoth story 

We do know that before the Europeans arrived, 
Africans hunted elephants in pit traps, or, 
in what must have been a pretty spectacular 
manoeuvre, running up and slashing them in 
the Achilles tendon. David Cumming of the 
Percy FitzPatrick Institute of African Ornithol- 
ogy at the University of Cape Town speculates 
that human predation has been the key limit 
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on the population of elephant-like animals 
— proboscideans — in Africa for more than 
a million years. He is of the camp that favours 
predation over climate change as an explana- 
tion for the extinction of earlier proboscidean 
species. 

The fossil record confirms that the number 
of proboscidean species dropped from nine to 
two after humans appeared on the evolution- 
ary scene. Cumming mentions, too, the many 
large mammals that humans almost certainly 
did kill to extinction. So can South Africa’s 
elephants prosper without culling? Is it pos- 
sible that by protecting elephants in special 
areas, we are removing what once limited their 
numbers? Animal rights activists and tourists 
don’t like hunting or culling. But it might be 
the simplest and most direct way to reduce 
elephant numbers. 

Not all experts agree that pre-gun human 
predators were capable of taking down enough 
elephants to directly regulate the population. 
But even if it’s not the ‘natural order’, it might 
still have a place in management. 

And as Kerley explains, hunting isn’t as sim- 
ple as reducing the total number. “It’s not how 
many elephants you kill, but how you influ- 
ence resource use through fear. Elephants 
are incredibly risk averse.” So by killing a few 
elephants near a certain village or farm or 
water-hole, you may be able to keep the rest 
of the herd, especially the more timid females 
and calves, from using that area. 

Kerley argues for exploring all 
options: “I suspect that in the 
end we will come up with a 
uniquely African solution. 
We might reinstate some 
level of predation — call 
it predation rather than 
culling — but we will 
also make more space 
for elephants.” And the 
inescapable conclusion 
from all across Africa is 
that elephants do have to 
be actively managed. It is 
interesting to look into their 
large dark eyes — famous for 
seeming wise, even world-weary 
— and wonder if they have any inkling that 
they are no longer in charge. 
Emma Marris is a reporter for Nature based in 
Columbia, Missouri. 
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Scientists should unite 
against threat from religion 


SIR — It was genuinely alarming to 
encounter Ziauddin Sardar’s whitewash of 
Islam in the pages of your journal (“Beyond 
the troubled relationship’ Nature 448, 
131-133; 2007). Here, as elsewhere, Nature’s 
coverage of religion has been unfailingly 
tactful — to the point of obscurantism. 

In his Commentary, Sardar seems to 
accept, at face value, the claim that Islam 
constitutes an “intrinsically rational world 
view’. Perhaps there are occasions where 
public intellectuals must proclaim the 
teachings of Islam to be perfectly in harmony 
with scientific naturalism. But let us not 
do so, just yet, in the world’s foremost 
scientific journal. 

Under the basic teachings of Islam, the 
Koran cannot be challenged or contradicted, 
being the perfect word of the creator of the 
Universe. To speak of the compatibility 
of science and Islam in 2007 is rather like 
speaking of the compatibility of science and 
Christianity in the year 1633, just as Galileo 
was being forced, under threat of death, to 
recant his understanding of the Earth's 
motion. 

An Editorial announcing the publication 
of Francis Collins’s book, The Language of 
God (‘Building bridges’ Nature 442, 110; 
2006) represents another instance of 
high-minded squeamishness in addressing 
the incompatibility of faith and reason. 
Nature praises Collins, a devout Christian, 
for engaging “with people of faith to explore 
how science — both in its mode of thought 
and its results — is consistent with their 
religious beliefs”. 

But here is Collins on how he, as a scientist, 
finally became convinced of the divinity of 
Jesus Christ: “On a beautiful fall day, as I 
was hiking in the Cascade Mountains... 
the majesty and beauty of God's creation 
overwhelmed my resistance. As I rounded a 
corner and saw a beautiful and unexpected 
frozen waterfall, hundreds of feet high, I 
knew the search was over. The next morning, 
I knelt in the dewy grass as the sun rose and 
surrendered to Jesus Christ.” 

What does the “mode of thought” 
displayed by Collins have in common with 
science? The Language of God should have 
sparked gasping outrage from the editors at 
Nature. Instead, they deemed Collins’s efforts 
“moving” and “laudable”, commending him 
for building a “bridge across the social and 
intellectual divide that exists between most of 
US academia and the so-called heartlands” 

Ata time when Muslim doctors and 
engineers stand accused of attempting 
atrocities in the expectation of supernatural 
reward, when the Catholic Church still 
preaches the sinfulness of condom use in 
villages devastated by AIDS, when the 
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president of the United States repeatedly 
vetoes the most promising medical research 
for religious reasons, much depends on the 
scientific community presenting a united 
front against the forces of unreason. 

There are bridges and there are gangplanks, 
and it is the business of journals such as 
Nature to know the difference. 

Sam Harris 
Address withheld by request 
www.samharris.org 


Religion: Islamic science 
fading before colonialism 


SIR — In his Commentary on the rise and 
fall of Islamic science (Nature 448, 131-133; 
2007), Ziauddin Sardar’s discussion of how 
modern Muslim societies could be improved 
by scientific endeavours is encouraging. 
However, he places most of the blame for 
the decline of Islamic science on Western 
colonialism. This is historically inaccurate. 
Like the tip of an iceberg, colonialism is not 
the major factor — just the most visible. 

It is true that colonialism “displaced 
meaningful cultural activities from Muslim 
society,’ and caused “the general economic 


and political deterioration of Muslim society” 


that led to the ultimate collapse of Islamic 
science. But it’s generally accepted that 

the golden age of Islamic science stretched 
from about 800-1400 ap, and its decline 
started more than a century before 
Western colonialism began in the late 
fifteenth century. 

A number of devastating blows to Muslim 
society have been implicated in the decline 
of Islamic science: military invasions, 
massacres and infrastructure destruction; 

a long period of drought beginning around 
1250 ap; and a series of plague epidemics 
between 1347 and 1515. These destructive 
developments overlap with the main period 
of decline of Islamic science in the fourteenth 
and fifteenth centuries. The onset of 
European colonialism from the late fifteenth 
century onward merely completed a process 
that had begun long before. 

Todd P. Silverstein 

Department of Chemistry, Willamette University, 
Salem, Oregon 97302, USA 


Berlin shows how natural 
history can pull the crowds 


SIR — You emphasized the roles that 
museums should play in both increasing 
and diffusing knowledge, in your Editorial 
“Museums need two cultures’ (Nature 446, 
583; 2007). Germany's premier natural- 
history institution, the Berlin museum of 
natural history, recently demonstrated the 
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potential that museums have to do the latter. 

On 13 July, its main palaeontology 
exhibition reopened after a redesign. The 
dinosaurs in the main hall are now all 
mounted according to the latest findings 
in vertebrate palaeontology. The tallest 
mounted dinosaur skeleton in the world — 
of Brachiosaurus brancai — is no longer 
sprawling but striding majestically through 
the hall. On the opening day, thousands of 
visitors queued from the underground 
station, breaking Germany’s record for the 
highest number of visitors to an exhibition 
during an opening weekend. 

The Berlin Brachiosaurus skeleton was first 
mounted in 1937. Seventy years, although 
little by dinosaur standards, is along time for 
scientists and the public to wait before money 
is provided for a modern exhibition. Funding 
bodies must recognize that natural-history 
museums are in a unique position to explain 
complex issues in science to a wide audience. 
But to do so, they need to be able to update 
their exhibits and present new research to the 
public on a regular basis. 

The results are well worth the investment. 
During the opening weekend, more than 
38,000 visitors preferred the Jurassic period 
to a day in the Berlin sunshine. 

Nizar Ibrahim 

School of Biology and Environmental Science, 
University College Dublin, Belfield, 

Dublin 4, Ireland 


Puns can be baffling, so 
keep headlines simple 


SIR — I beg to differ with Renée M. Ned 
and Lisa N. Steele's Correspondence ‘Slang’s 
not so slick when you remember its origins’ 
(Nature 447, 775; 2007) about the use of 
the word ‘pimp’ in a News Feature headline 
(‘Pimp my antibody’ Nature 446, 

964-966; 2007). 

The word first appeared in sixteenth- 
century France as the verb pimper, meaning 
‘to dress elegantly; and as the adjective 
pimpant, ‘alluring in dress, seductive. In 
the seventeenth century, the word was 
associated with ‘a knave, rascal, varlet, 
scoundrel; according to the Online 
Etymology Dictionary (www.etymonline. 
com). The vulgar modern meaning probably 
derives from a combination of these. The 
sense in which it is used by the television 
show Pimp My Ride could imply dressing an 
automobile elegantly — admittedly with a 
hint of flashy style. 

I personally find the use of puns, 
colloquialisms and cultural references 
more objectionable, as they are likely to be 
understandable to only a fraction of Nature’s 
global readership. The English language 
is sufficiently complex without the need 
to understand these sometimes obscure 


THE SIMPSONS: FOX 


references in the headlines of Nature News 
stories and other similar articles. 

A quick scan ofa few issues yields: “...over 
a pork barrel”; “Oceanography: Churn, 
churn, churn’; “...science in premier league”; 
“State of the donation”; “Astrophysics: The 
answer is blowing in the wind”; “Scot on the 
rocks”; and “The silence of the robins”. As a 
native English speaker I may understand and 
appreciate these, but many others wouldnt. 
Jeff Craig 
Epigenetics Research Laboratory, 
The Murdoch Childrens Research Institute, 
Royal Children’s Hospital, Flemington Road, 


Parkville, Victoria 3052, Australia 


Puns: wimp or macho, nota 
particle of offence is meant 


SIR — Beyond the excellence of the 
scientific reporting, I particularly enjoy 
the entertaining use of language and the 
enlightened levity of Nature. Consequently, 
Iam somewhat concerned by the complaint 
of R. M. Ned and L.N. Steele (‘Slang’s not so 
slick when you remember its origins’ Nature 
447,775; 2007) regarding the use of the verb 
‘pimp; because of its “immoral origins”. 

Should I take offence because the use 
of the terms ‘wimp’ and ‘macho to denote 
putative particles might perpetuate negative 
stereotyping of my gender? 

The freedom and, especially, the humour 
of scientific reporting may be hindered 
by misguided attempts to avoid offending 
moralists. 
Milan Hopkins 
9479 Main Street, PO Box 638, 
Upper Lake, California 95485, USA 


Regions unite to challenge 
inequalities in Brazil 


SIR — The “unequal struggle” of medium- 
sized university physics departments in 
Scotland “against larger and more- 
entrenched rivals” is highlighted by 
your Editorial ‘All for one... (Nature 
447, 1031; 2007). The less developed 
countries of the world face an even 
greater problem. How can science 
help the economic improvement of 
these countries, taking into 

account not only “basic challenges 
in trying to compete globally’, 

but also regional difficulties — 
particularly very limited facilities and 
insufficient human resources? 

As an illustration of the regional 
problem, take the Human 
Development Index, which measures 
quality of life: a score above 0.5 is 
medium and above 0.8 is high. The south 


of Brazil scores about 0.84 — not far behind 
some European countries — whereas the 
northeast of Brazil is around 0.67. 

Along the same lines as the Scottish 
Universities Physics Alliance described in 
your Editorial, the Brazilian government and 
universities created the Northeast Network 
on Biotechnology, or Renorbio (www. 
renorbio.org.br) in 2003. Last year a graduate 
programme in biotechnology was added, 
consisting of a pool of 29 universities from 
10 different Brazilian states. 

This initiative is new in the following five 
ways. It aims to produce 100 PhD graduates 
a year, which should speed regional 
development. Many universities that were 
not strong enough previously to offer a PhD 
programme now can, and will also gain 
research funds. Researchers who join the 
programme qualify for PhD students. The 
universities are becoming more competitive 
both locally and internationally. They are 
now expected to produce more patents and 
publications, creating stronger departments 
and leading to an increase in the number of 
knowledge-based enterprises in the region. 
Luiz A. B. Castro*, Allan Kardec Barros} 

*R&D Policies and Programmes, 

Ministry of Science and Technology, 

Esplanada dos Ministérios, Bloco E, Brasilia, 

DF, 70067-900, Brazil 

+Department of Electrical Engineering, Federal 
University of Maranhao, Av. dos Portugueses, s/n, 
Sao Luis, MA, 65080-040, Brazil 


Summing up The Simpsons 


SIR — In your amusing News Feature 
interview with Harvard mathematics 
graduate Al Jean, head writer of The 
Simpsons (‘Mmm... Pi Nature 448, 404; 
2007), Jean mentions, as an example of a 
staggeringly obscure mathematical reference, 
a number in the thousands that is the sum 
of four squares. However, as 
I'm sure every Harvard 
mathematics graduate 
knows, every positive 
integer is the sum of four 
integer squares. I think 
you've been had! 
J. Ewart H. Shaw 
Department of Statistics, 
University of Warwick, 
Coventry CV4 7AL, UK 
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The true 
significance of 
8,208, the number 
referred to in the 
News Feature, is that it 
is one of only three 
four-digit numbers that 
are the sum of the fourth 
powers of their digits 
— Editor, Nature. 
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Drop ‘higher’ and ‘lower’ to 
raise descriptive standards 


SIR — For most of the past 2,500 years, 

the dominant view in the West was that 

life was graded from better to worse, higher 
to lower, in a Chain of Being. This could be 
the product of some creative deity or — as 
in the eighteenth- and nineteenth-century 
theories proposed by Denis Diderot, Jean 
Baptiste de Lamarck and Robert Chambers 
— aconsequence of matter somehow 
organizing itself through a goal-directed, 
progressive evolutionary process. Humans 
invariably occupied the top link. 

The scientific literature persists in 
providing a home for the terminology of the 
Chain. During the past year, for instance, the 
Nature.com search facility identifies more 
than 300 references to ‘higher’ or ‘lower’ 
eukaryotes or organisms. 

This usage is misleading, because evolution 
is not a progressive process. Selection may, in 
the short term, lead to increases in fitness. 
But there is no reason to assume that species 
adapted to the environments they occupy 
today are better adapted than their precursors 
were to their environments, and if evolution 
does not ensure this it hardly qualifies as 
progressive. Of course, because the first life 
forms were very simple, it is not surprising 
that evolution has resulted in increases in 
specialization or complexity in many lineages. 
But there is no sense in which such changes 
can make one taxon higher than another. 

An argument in favour of preserving the 
terminology of the Chain is that it is useful. 
Yet there is considerable disagreement over 
the referents of ‘higher’ and ‘lower’ The status 
of plants, for example, as higher or lower 
organisms is very fluid. Yeast is typically a 
‘lower’ organism when compared with 
animals but can bea ‘higher’ one when 
compared with bacteria. Perhaps more 
disturbingly, and in a usage reminiscent of 
natural theology, non-human mammals can 
be relegated to the group of lower organisms 
when they are compared with humans. 

A descriptive terminology hardly qualifies 
as useful if users disagree over what it 
describes. Moreover, it is rarely the case that 
using a consistent and scientifically robust 
terminology is more difficult than using the 
inconsistent and misleading terminology of 
the Chain. Distinguishing between plants 
and animals, for example, or between 
flowering and non-flowering plants, or 
between vertebrates and invertebrates, 
or between humans and other primates/ 
mammals/animals without resorting to the 
descriptors ‘higher’ and ‘lower’ is easy — 
it’s just been done. 

Michael Mogie 

Centre for Mathematical Biology, 
Department of Biology and Biochemistry, 
University of Bath, Bath BA2 7AY, UK 
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Four ways to take the po 


icy plunge 


How should researchers best interact with policy-makers for maximum benefit to society? 


The Honest Broker: Making Sense of 
Science in Policy and Politics 

by Roger Pielke Jr 

Cambridge University Press: 2007. 198 pp. 
$29.99, £15.99 


Andrew A. Rosenberg 

Few scientists are trained to be policy advisers. 
Many academics look askance at politics, as 
though there might be something unseemly 
about bringing their research to bear on prob- 
lems of the day. This is probably not so much 
from disinterest as from a lack of experience in 
real-world policy debates and of an academic 
incentive to participate in them. Happily, the 
book by Roger Pielke Jr on the engagement 
of scientists in policy offers a pithy, insightful 
basis for discussing the contributions scientists 
can make to advising policy-makers. 

Pielke sets out four basic modes for how 
scientists advise on policy and two models 
for scientific advice. The first mode is for the 
‘pure scientist, who is not involved at all. This 
is the colleague who shakes their head ruefully 
because the policy-makers were foolish enough 
not to read his or her recent papers. The second 
is for the ‘science arbiter, who readily responds 
to questions but without voicing an opinion 
on the policy choices. Again, this person 
sets themself apart from the problems of the 
policy-maker and is loath to take ownership 
of the results. 

The third role is the ‘issue advocate, who 
takes a clear position and argues for specific 
policy action, using scientific knowledge and 
training to make that argument. Finally, the 
‘honest broker’ — Pielke’s obvious preference 
— engages in the policy process to bring sci- 
entific information to bear by considering the 
options and helping policy-makers to choose. 
Here, there is real ownership of the process, 
and products of policy as options are explored 
from a scientific perspective. 

These caricatures of the roles that science 
advisers play provide useful scope for thought 
about how to interact with policy-makers. I 
have had experience with all four roles at vari- 
ous points in my career as both scientist and 
policy-maker, and agree with Pielke that scien- 
tists often make the most impact by serving as 
honest brokers. But his view of issue advocates 
is excessively disparaging. Although a scientist 
who takes a strong position on policy may not 
comport with the view of some that science is 
somehow ‘pure and above the messy fray of 


At the 2007 Intergovernmental Panel on Climate Change conference, scientists help shape policies. 


policy debates, such advocacy has had a hugely 
positive effect on policy. 

Issue advocates are to be welcomed and the 
scientific evidence weighed as it always should 
be, irrespective of who brings it to the table. It is 
too easy to label a scientist as an issue advocate 
to downplay their opinion without a rigorous 
reason. In my experience, most scientists who 
take the role of issue advocate do so out of con- 
viction resulting from their work, in much the 
same way as researchers will argue strongly 
for their interpretation of research results. It 
is therefore inappropriate to assume that issue 
advocates rely on science that is less rigorous. 

The strongest case that Pielke makes is this: 
“That some scientists engage in political activi- 
ties is neither new nor problematic; they are 
after all citizens. A problem exists when ... 
scientists implicitly or explicitly equate scien- 
tific arguments with political arguments, and 
in the process reinforce a simplistic and mis- 
leading view of how science supports policy.” 
The challenge is to separate carefully which of 
your inferences stem solely from the science, 
and which from your views as a citizen. 

The two models for science advice Pielke 
describes are the linear and the stakeholder 
models. The first is a simple progression from 
science to policy, with no real feedback; the 
second is an integrated system. I tired of Pielke 
railing against the linear model, and more so of 
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the implication that this model somehow leads 
to scientists unwittingly becoming issue advo- 
cates instead of honest brokers. Stakeholder 
involvement is essential, but can accentuate 
the tendency to polarize positions and for sci- 
entists to advocate one policy over another. 

Pielke’s choice of the basis for the Iraq war 
as an example of advisory processes, and how 
they fail, is a missed opportunity. An alter- 
native scientific case study would have been 
more relevant to the role that most scientists 
may play in policy-making. For example, the 
natural-resource issues that I have dealt with 
in fisheries and the marine environment have 
extensive advisory processes and stakeholder 
input. Scientists were engaged in these issues in 
all of Pielke’ four roles. The process is difficult 
and time-consuming, but it is likely to be much 
more relevant to the work of most scientists 
than the gathering of military intelligence. 

This is a clear, thought-provoking book that 
helps move us away from thinking of science as 
‘pure’ and distinct from policy. It would make 
an excellent basis for a graduate seminar. It isn’t 
a textbook, but a think-piece, and we all need to 
consider carefully our responsibility to engage 
as scientists in policy-making. a 
Andrew A. Rosenberg is professor of natural 
resources at the Institute for the Study of Earth, 
Oceans and Space, University of New Hampshire, 
Durham, New Hampshire 03824, USA. 
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Mankind's strange love of superweapons 


Doomsday Men: The Real Dr Strangelove 
and the Dream of the Superweapon 

by P.D. Smith 

Allen Lane/Penguin: May 2007. 

553 pp. with index, £20 

St Martin's Press: December 2007. 416 pp. 
$27.95 


Gregg Herken 

“There is nothing in Man’s industrial machin- 
ery but greed and sloth: his heart is in his 
weapons,’ said the Devil in George Bernard 
Shaw’s Man and Superman. Shaw's adage could 
almost be the leitmotiv of P. D. Smith’s well- 
researched and altogether depressing account 
of humankind’s long hunt for the ultimate 
superweapon: a doomsday device that, by its 
very terribleness, would make war forevermore 
unwinnable, and hence unthinkable. Although 
we all know how this tale turns out, it is a jour- 
ney well worth taking. Along the way, Smith 
includes some fascinating asides about the 
men — and it was, almost exclusively, a frater- 
nity — who, in seeking to make war obsolete, 
have only made it more deadly. 

Typical of the breed was German chem- 
ist Fritz Haber, who invented and personally 
promoted the use of poison gas on First World 
War battlefields. As with his contemporaries in 
the other belligerents, Haber started out as an 
idealist who, once hostilities began, quickly 
harnessed his science in the service of the 
state. His motto, “Im Frieden der Menschheit, 
im Kriege dem Vaterland” (In peace, for man- 
kind; in war, for the fatherland), sets a pattern 
followed with distressing regularity by other 
idealistic scientists in subsequent conflicts. 

The real focus of Smith’s book is Leo Szilard, 
a Hungarian-born American physicist who fled 
Nazi Germany and eventually wound up in the 
United States. Szilard was arguably the first to 
envisage how a nuclear weapon might be made. 
“You know what fission means,” he confided to 
fellow physicist Edward Teller in early 1939, “it 
means bombs.” 

Ironically, it was a seemingly off-hand remark 
made by Szilard in a nationwide radio broadcast 
in February 1950 — shortly after the Truman 
administration announced plans to proceed to 
the next level of destructiveness, the hydrogen 
bomb — that revived interest in a true ‘dooms- 
day’ weapon. Szilard envisaged building a bat- 
tery of very powerful hydrogen bombs jacketed 
with cobalt-60, an element with a radioactive 
half-life of 5.7 years. In a later iteration, the 
bombs would be hooked up to a computer, 
which, on receiving a certain signal — say, a 
single nuclear weapon exploding on friendly 
territory, or the crossing of a border by enemy 
troops — would detonate them. Theoretically, 
such a device, if set off, would enshroud Earth 
with lingering, lethal levels of radioactivity, 
snuffing out all human life. 

In the end, the cobalt bomb was never very 


868 


Leo Szilard was among the first to conceive of a nuclear weapon. 


centrepiece of his book. 

Doomsday Men is prodigiously 
researched, the author seeming to 
have read everything on nuclear 
strategy, both fiction and non- 
fiction (admittedly, the two are 
sometimes difficult to tell apart). 
However, there are some nagging 
deficiencies. Editing is one. We 
are told twice within a few pages 
that Leo Szilard was considered 
“the greatest scientist never to 
have won a Nobel Prize.’ Focus 
is another. The book is as much 
a history of modern science as 
of modern weaponry, so domi- 
nant has been the latter in the 
evolution of the former. But it is 
not clear, for example, what con- 
nection the discovery of X-rays 
has with the advent of doomsday 
weapons. 

One can only sympathize with 
the author’s observation that, 
since the end of the Cold War, 
global warming and Islamist 
terrorism have distracted our 


attractive to even the most bloody-minded 
of strategists because it would not have been 
destructive enough: the longer the half-life, the 
less intense the radiation. That did not stop a 
generation of novelists and film-makers from 
enlisting the cobalt bomb in their dooms- 
day scenarios: among them, novelist Nevil 
Shute in On the Beach and director Stanley 
Kubrick in Dr. Strangelove. Nor did it prevent 
P. D. Smith from making the cobalt bomb the 


attention from the weapons that 
remain in the arsenals of nations, numerous, 
primed and waiting. Although not as deadly as 
Smith's fictive doomsday bomb, they are cause 
for us to be more fearful, for they are real. 
Gregg Herken is professor of history at the 
University of California at Merced, PO Box 2039, 
Merced, California 95344, USA, and author of 
Brotherhood of the Bomb: The Tangled Lives and 
Loyalties of Robert Oppenheimer, Ernest Lawrence, 
and Edward Teller. 


Linnaeus lives on 


Order out of Chaos: Linnaean Plant Names 
and their Types 

by C. Jarvis 

The Linnean Society of London: 2007. 

1,016 pp. £80 


Pamela S. Soltis 
The relevance of taxonomy in our genomic era 
is greater than ever. Correct naming is crucial 
for developing new foods and medicines, and 
for understanding our changing environment. 
Amazingly, we do not even know how many 
species of flowering plant currently exist. 
Estimates range from about 250,000 to more 
than 400,000, and every day species are lost 
to human activities. Conservation, in natural 
habitats or botanical gardens, relies on scien- 
tific names, as does the rest of science. 

Before Carl Linnaeus, species names ranged 
from cumbersome to unwieldy. For exam- 
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ple, Arbutus caule erecto, 
foliis glabris serratis, baccis 
polyspermis was as goodas it 
i. got for ‘Arbutus with upright 
ipaerys | stems, hairless, saw-toothed 
300. leaves, and many-seeded 
—.__ berries’ Linnaeus’s binomial 
system of nomenclature changed all that. Spe- 
cies could now have a genus name plus a spe- 
cific epithet. “The father of taxonomy’ applied 
his revolutionary system to plants in Species 
Plantarum, published in 1753, and to animals 
in the tenth edition of Systema Naturae in 1758. 
More than 250 years later, the name ofa species 
remains the key that unlocks all that is known 
about it. 

In honour of Linnaeus’s 300th birthday, 
Charlie Jarvis’s handsome new book, Order 
out of Chaos, gives us a glimpse into pre- 
linnaean botany, Linnaeus’s life and work, 
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and the reception and impact of his ideas in 
eighteenth-century Europe. 

Linnaeus was an avid collector and classi- 
fier of ‘animals, vegetables and minerals. He 
named and described his collections and the 
many specimens that were returned to Europe 
from exploration of other parts of the globe. 
His plant names are the starting point for valid 
names that follow botanical nomenclature. To 
be valid, a species name must be attached to 
one reference or ‘type’ specimen. This con- 
cept was implemented 150 years after Species 
Plantarum, so — ironically — most of the 
names assigned by Linnaeus himself do not 
have clearly associated specimens. 

Enter the Linnaean Plant Name Typifica- 
tion Project, launched in 1981 at the London 
Natural History Museum to match Linnaeus’s 
thousands of designations with the specimens 
and illustrations in his work. Jarvis’s book 
chronicles this project and presents nearly 
700 pages of names and their types — from 
Abrus precatorius L. to Zygophyllum spinosum L. 
Jarvis and his colleagues scoured museums and 
private collections to create this wonderful 


One of many plant species named by Linnaeus. 


reference for all those involved in taxonomy. 

The book is a work of art. Even non-botanists 
will be seduced by this inviting package. The 
text is peppered with photographs and illustra- 
tions of spectacular examples of plant diversity 
and linnaean memorabilia. Figures range from 
a photograph of Linnaeus’s signature and seal 
from 1734, to an illustration of the wild wood- 
land twinflower (Linnaea borealis), to Georg 
Dionysius Ehret’s (1736) illustration of Linnae- 
us’s ‘Sexual System of plant classification. 

The appreciation for history in plant system- 
atics is evident in this book and in the attention 
being paid to Linnaeus during 2007. Even the 
controversy between those who favour the con- 
tinued use of linnaean classification and advo- 
cates of the PhyloCode (the system that relies 
on evolutionary history) does not diminish 
Linnaeus’s contribution. This book is a timely 
homage to linnaean names and the efforts of 
Jarvis’s team, as systematists look for new ways 
to provide the richest classification systems. ™ 
Pamela S. Soltis is at the Florida Museum of 
Natural History, University of Florida, Gainesville, 
Florida 32611, USA. 


The theatre of quantum physics 


Faust in Copenhagen: A Struggle for the 
Soul of Physics 

by Gino Segré 

Viking/Jonathan Cape: 2007. 320 pp. 
$25.95/£20.00 


Finn Aaserud 

The year 1932 was a particularly eventful 
one in physics. In Faust in Copenhagen, the 
physicist Gino Segré chooses as his point of 
departure an annual meeting held in April of 
that year at Niels Bohr’s Institute for Theoreti- 
cal Physics at the University of Copenhagen, 
Denmark. In Segré’s view, the 1932 gathering 
symbolized the end of the political neutral- 
ity of physics and physicists, coinciding as it 
did with the arrival of Hitler and crucial dis- 
coveries in nuclear physics that would make 
possible the subsequent development of the 
atomic bomb. 

Segré introduces us at the outset to seven 
physicists who attended this series of infor- 
mal but prestigious meetings: Niels Bohr, Paul 
Dirac, Werner Heisenberg, Paul Ehrenfest, 
Max Delbriick, Lise Meitner and Wolfgang 
Pauli. He presents a fascinating compari- 
son of these main characters in the first 
hundred pages, and then launches into a 
spirited and engaging history of the develop- 
ment of quantum physics from about 1900 
to 1932. Segré’s emphasis is always on the 
human aspect, and he is unable in this part 
of the book to limit his story to the seven 
physicists whom he claims the book is about. 
Towards the end, he returns to the original 
group, vividly describing their subsequent 


careers, again with illuminating comparisons. 

As the nephew of Emilio Segré, a Nobel 
prize-winning physicist trained by some of the 
main characters in the book, Segré deploys his 
background effectively for dramatic and com- 
parative purposes. This, however, shortens his 
distance from the subject matter — for exam- 
ple, the role of experimental physics at Bohr’s 
institute is underplayed. Moreover, he seem- 
ingly contradicts his interpretation of the 1932 
meeting as the end of the political neutrality of 
physics, by praising his own field for continu- 
ing to this day in the disinterested tradition 
instigated by Bohr. 

Bohr’s crucial contributions to quantum 
mechanics, Segre writes, were “largely ignored” 
by Segré’s own generation of physicists, and 
he seems intent on rectifying the situation. 
Whether or not this is necessary, Segré’s 
approach reveals important insight into Bohr 
as a person as well as into his interaction with 
and importance for physicists who followed 
him. 


At the end of the 1932 meeting, in celebra- 
tion of the hundredth anniversary of Johann 
Wolfgang von Goethe's death, some of the 
younger scientists performed a parody of 
Faust, replacing Goethe’s original characters 
with some of the main theoretical physicists 
of the day, most of whom were also assembled 
in Copenhagen: Niels Bohr was portrayed as 
God, Wolfgang Pauli as the Devil and Paul 
Ehrenfest as Faust. The parody described the 
state of physics at the time, and Segré regards 
it as sufficiently important to name his book 
after it. 

At least to me, however, Segré does not inte- 
grate the play into his narrative satisfactorily. 
The worldwide success of Michael Frayn’s play 
Copenhagen, first set up in London in 1998, 
inspired a still-active interest in the relation- 
ship between science and the theatre. Frayn’s 
play even shares two of Segré’s main charac- 
ters — Niels Bohr and Werner Heisenberg. 
But Faust as 
enacted at the 
1932 meet- FP 5 
ing was writ- 
ten not by a 
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Wolfgang Pauli as the Devil in a parody of Faust, performed by physicists at a meeting in 1932. 
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professional playwright but by the participant 
physicists themselves, and one might have 
hoped that Segré would have used the oppor- 
tunity to offer a new twist to the relationship 
in question. Admittedly, there are quotations 
from the original Faust at the head of each 
chapter, as well as a chapter entitled ‘Goethe 
and Faust’ and another comparing the physi- 
cists’ play with Goethe's original. But these 
instances are provided largely in isolation from 
the main narrative of the book. 

In 1932, German was still the main language 


of physics — and culture — so the choice of 
Goethe's Faust as a basis for a satire enacted 
in German seems natural. Probably in antici- 
pation of his likely readership, Segré does not 
quote from the original German, but quotes 
instead from necessarily inferior English trans- 
lations. The many quotations from Goethe's 
Faust are taken from Anna Swanwick’s excel- 
lent, but by now somewhat archaic, translation 
from late in the nineteenth century. Likewise, 
Segré’s treatment of the physicists’ Faust is 
based on the spirited, but sometimes inaccu- 


rate, English translation published in 1966 in 
a highly personal account, Thirty Years That 
Shook Physics, by the physicist George Gamow 
(to whom Segre dedicates his book). 

The photographs used to illustrate Faust in 
Copenhagen are well chosen. Apart froma few 
minor omissions and factual errors, the book 
is lively and readable and provides an exciting 
impression of the development of theoretical 
physics during a crucial period. a 
Finn Aaserud is director of the Niels Bohr Archive, 
Blegdamsvej 17, DK-2100 Copenhagen, Denmark. 


Medical history without frontiers 


Medieval Islamic Medicine 

by Peter Pormann and Emilie Savage- 
Smith 

Georgetown University Press/Edinburgh 
University Press: 2007. 223/256 pp. 
$29.95/£18.99 


Yasmin Khan 

What is ‘Islamic medicine’? It is a term fraught 
with limitations. No definitive definition exists, 
but most reputable historians of medicine have 
characterized it by several key features. These 
include a chronological marker that spans a 
medieval period of a thousand years and 
encompasses specific but vast geographi- 
cal regions from Spain and North Africa in 
the west, to central Asia and India in the 
east. Its successes were inspired implic- 
itly by the Islamic faith, but sometimes 
involved discreet collaboration of Muslims 
with non-Muslims, who used Arabic as 
the lingua franca to cultivate the resulting 
phenomenon. 

These idiosyncratic dynamics of Islamic 
medicine are explored in Medieval Islamic 
Medicine. With their meticulous research on 
newly translated manuscripts, two pre-emi- 
nent scholars of medical history, Pormann 
and Savage-Smith, have revisited the topic 
discussed in the classic text Islamic Medi- 
cine by Manfred Ullmann, published almost 
30 years ago. 

Their new, brief overview is destined to 
become a definitive entry-level textbook, 
owing in part to its simple structure, compre- 


hensive analysis and succinct interpretation of 


key historical figures and their achievements. It 
relies on a thematic rather than chronological 
approach, which enhances its readability. 

The book discusses the emergence of medie- 
val Islamic medicine and the subsequent cross- 
pollination with medicine in other cultures. We 
are given a sense of how Muslims perceived the 
body as well as the soul as precious, because it 
was derived from and accountable to God as 
the creator. The body therefore required con- 
stant and dutiful care, and had the right to be 
maintained and preserved and protected from 
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abuse, meaning that the individual was bound 
to seek a cure for any ills. These sensibilities 
were influenced by precursor civilizations — 
particularly ancient Greece — and reinforced 


by the Islamic faith. 


Pormann and Savage-Smith refute the cli- 
chéd view that the Islamic medieval civilization 
was merely a bridge between ancient and mod- 
ern medicine. They show how Muslims built 
on Greek knowledge, in addition to preserving 
and translating it. The resulting new ideas and 
theories later percolated further into the West 
through translation, trade and travel, as well as 


A Persian physician gives instructions for preparing 
smallpox remedies in the eleventh century. 


and linear approach that besets many texts 
and museum displays of medical history, 
which often portray the past as inferior and 
the present as the benchmark of progress. And 
without taking the ‘great man of history’ line, 
they give us Islamic medical pioneers aplenty 
— for example, Abu al-Qasim al-Zahrawi 
(Albucasis). This accomplished surgeon from 
eleventh-century Muslim Spain first described 
and used forceps for extracting fetuses during 
stillbirths, developed other specialized surgical 
tools, and sutured wounds using cat gut. 
Pormann and Savage-Smith also discuss the 
role of women in the development of medi- 
eval Islamic healthcare. Patronage by wealthy 
women associated with those in power proved 
instrumental in providing the financial invest- 
ment needed for establishing hospitals in 
Baghdad from the ninth century. These 
institutions offered free, specialized treat- 
ments, which resembled the founding ethos 
of the modern British National Health 
Service. 

What led to the demise of the strong- 
hold of Islamic medicine in general is not 
addressed in the book: this is an unresolved 
and complex question. Some blame religion, 
or the lack of it, and others point the finger 
at social factors such as war and colonial- 
ism. This subsidence has been perpetuated 
and exacerbated by the ‘brain drain’ phe- 
nomenon. That said, the legacy created by 
the practice of Islamic medicine is alive and 
thriving in the rural Middle East, through- 


during the crusades, acting as a springboard 
towards the European renaissance and thus 


forming the roots of modern medicine. 


The book emphasizes the inclination of early 
Muslim medical practitioners towards Unani 
Tibb, a form of Islamic medical tradition stem- 
ming from early Greek medicine. This involved 
balancing, through diet and medicinal herbs, 
the four humours — air, earth, fire and water, 
which correspond to the four bodily fluids 
blood, phlegm, black bile and yellow bile — and 
understanding that a good environment and 
spiritual peace are essential for good health. 
Today such a holistic approach to well-being, 
although experiencing a resurgence, is outside 


mainstream modern medicine. 


The authors avoid the irksome cumulative 
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out Africa, among the Muslims of southern 
Asia and in Asian diaspora communities in 
Europe and the United States, indicating that 
there is scope for a wider revival. 

Medieval Islamic Medicine is a measured 
and scholarly book that is a timely rebuke to 
those tempted to dismiss the past as irrelevant. 
The medieval period when the Islamic world 
drove medical development and discovery is 
an indication that Islam itself is not inimical 
to progress. This book should be read by any- 
one interested in the real story of civilization. 
It is a notable contribution to the promotion 
of a multicultural public understanding of the 
history of science. a 
Yasmin Khan is exhibition coordinator at the 
Science Museum, Exhibition Road, London SW7 
2DD, UK. 
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Obsessed with grooming 


Steven E. Hyman 


Roughly 2% of humans suffer from obsessive compulsive disorder, but a lack of animal models has impeded 
research into this condition. Could a genetically engineered mouse model provide an exciting lead? 


Obsessive compulsive disorder (OCD) is one 
of a spectrum of disorders characterized by 
obsessions (intrusive, unwanted thoughts) and 
compulsions (ritualized behaviours intended 
to overcome the anxiety and tension resulting 
from the obsessions). Other similar conditions 
include Tourette's syndrome; sufferers with this 
disorder show several motor, and occasionally 
vocal, tics, often accompanied by obsessions 
and compulsions. Although the evidence is 
less convincing, another disorder that might 
fit into this spectrum is trichotillomania, the 
hallmark of which is compulsive pulling out 
of scalp hair’. On page 894 of this issue, Welch 
et al.” describe a genetically engineered mouse 
that shows some behavioural features similar 
to those of the obsessive compulsive spectrum 
of disorders in humans’. 

Often beginning early in life and running 
a chronic and relapsing course, OCD causes 
significant distress and disability. The avail- 
able treatments for this condition are only 
moderately effective. They include cognitive 
behavioural psychotherapies and antidepres- 
sant drugs that increase levels of the neuro- 
transmitter serotonin at synapses (junctions 
between neurons). As is often the case in psy- 
chiatric disorders, the existing drugs for OCD 
stem from exploitation of serendipitous clinical 
observations. Neither the pathophysiology of 
the disorder nor the mechanism of action of 
these drugs is understood. 

Given the toll of OCD and other psychiatric 
disorders, safer and more effective medica- 
tions are much needed; but four main obstacles 
remain. First, understanding the neurobiol- 
ogy of higher cognition, emotion and con- 
trol of complex behaviour is still a daunting 
frontier. Second, those with psychiatric disor- 
ders lack obvious and visible signs of damage 
to the nervous tissue, the presence of which 
could point the way to identifying molecular 
culprits. Third, the genetic and non-genetic 
factors contributing to psychiatric disorders 
are highly complex*. And fourth, good animal 
models have been lacking. 

The mice studied by Welch et al.” showed 
excessive grooming, which resulted in hair loss 
and skin injuries, as well as anxiety-like traits. 
These mice lack the gene encoding SAPAP3 


Compulsive hand-washing is a 
common symptom of obsessive 
compulsive disorder. 


—a scaffolding protein that is found in exci- 
tatory, glutamate-responsive synapses and is 
highly expressed only in the striatum region 
of the brain. The behavioural abnormalities in 
these mice were reversed by local expression of 
Sapap3 in the striatal region, which indicates 
that loss of this gene is responsible for the 
observed behavioural abnormalities. 

The authors also found that a drug from 
the SSRI class — which selectively enhance 
serotonin-mediated neurotransmission 
throughout the brain — that is used to treat 
OCD in humans decreases both grooming and 
anxiety in these mice. This is interesting 
because a condition responsive to an enhancer 
of serotonin neurotransmission does not sig- 
nify a primary defect in serotonin-mediated 
signalling; instead, the defect is in glutamate- 
responsive synapses. So alterations in serotonin 
seem to modulate glutamate action. These 
findings are also noteworthy because Welsh 
and colleagues have generated a possible mouse 
model of OCD. Moreover, these observations 
add to the accumulating, if circumstantial, evi- 
dence that OCD and its associated disorders 
result from abnormalities in neural circuits 
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spanning the frontal, striatal and thalamic 
regions of the brain. 

Studies involving structural neuroimag- 
ing of the striatum in the brains of patients 
with OCD”* have often indicated abnormali- 
ties in the volume of striatum components, 
although they do not agree on the nature of 
these changes. However, a long-term study’ 
of patients with Tourette's syndrome suggests 
that the relationship between the volume of the 
caudate nucleus (a component of the striatum) 
and the symptoms of this disorder is complex; 
at the time of the scan (childhood), smaller 
caudate volumes did not correlate with symp- 
toms, but they did predict persistence of severe 
symptoms into early adulthood. 

Functional neuroimaging studies® on 
patients with OCD, based on cognitive and 
emotional tasks, also point to abnormalities in 
frontal-striatal-thalamic circuits, although we 
cannot yet claim to be on firm ground. That 
said, a striatal origin for OCD-like symptoms 
would make a lot of sense. The medium spiny 
neurons of the striatum receive convergent 
glutamate-mediated inputs from the cerebral 
cortex, which provide detailed information 
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about the context in which behaviours occur. 
These neurons also receive dopamine-medi- 
ated inputs from the midbrain that report on 
the significance of the behaviour — for exam- 
ple, whether it yields an unexpected reward. 

When something important happens, the 
striatum stimulates circuits that project via 
the thalamus to the prefrontal cortex; this 
leads to the formation of memories that can 
later guide the planning and control of behav- 
iour. Abnormalities in synaptic function within 
frontal-striatal-thalamic loops could lead to 
unintended behaviours or even unintended 
thoughts. The main significance of the Sapap3- 
deficient mouse is that it represents a model 
of how such a pathological process might 
work. In this case, abnormal synaptic function 
causes grooming behaviours that are repetitive 
enough to produce skin injuries and unrespon- 
sive enough to control by the prefrontal cor- 
tex that they continue even after the injuries 
have occurred. 

The mouse model described by Welsh and 
colleagues will excite those interested in the 
pathophysiology of OCD or in developing 
treatments for it. But we must retain a healthy 
caution. Genetic animal models of psychiatric 
disorders have been produced by inbreeding for 
desired traits’, by introducing human disease 
genes’’, by mutating existing animal genes" 
or, as in this case, by observing an unexpected 
behavioural response in a genetically engi- 
neered animal. It is highly unlikely that such 
animal models will ever recapitulate human 
psychiatric disorders in their entirety. However, 
as in this case, it is possible to model significant 
aspects of these diseases. 

The core symptom of OCD — unwanted 
intrusive thoughts — cannot be mimicked in 
mice, at least not in any obvious way. Instead, 
Sapap3-deficient mice show excessive groom- 
ing that continues past the point of self-harm. 
This is reminiscent, perhaps, of compulsive 
hand-washing, which in patients with OCD 
represents a never fully successful attempt to 
neutralize fears of contamination. 

In OCD patients, the main cause of anxiety 
is the unwanted intrusive thoughts. The suffer- 
ers are anxious because they cannot be certain 
that the door is locked, the gas has been turned 
off, or that they are free of dreaded microbes. 
The anxiety-like behaviours observed in these 
mice may also resemble OCD, but this requires 
a stretch of the imagination. 

Even if we can gain assurance with addi- 
tional research that the behaviours observed 
in Sapap3-deficient mice reflect abnormalities 
in circuits that produce human symptoms, we 
cannot assume that OCD-related conditions in 
humans involve variations in this gene. These 
disorders, like other major psychiatric dis- 
eases, seem to be heterogeneous with complex 
underpinnings — probably involving several 
genes — that, in interaction with developmental 
and environmental factors, could lead to abnor- 
malities in frontal—striatal-thalamic circuits. 

Despite these reservations, the work of Welsh 


872 


et al.” sharpens our focus on frontal-striatal- 
thalamic circuits both in human patients and 
in animal models of OCD. It also gives us a 
compelling clue that the compulsive behaviour 
associated with this condition is due to a syn- 
aptic abnormality in these neural loops. Such 
cellular insight should aid an understanding of 
OCD at a molecular level. a 
Steven E. Hyman is in the Department of 
Neurobiology, Harvard Medical School, 
Massachusetts Hall, Cambridge, Massachusetts 
02138, USA. 

e-mail: seh@harvard.edu 
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QUANTUM PHYSICS 


Wave goodbye 


Luis A. Orozco 


When measuring photons, it's a case of ‘wanted, dead’ — catching them 
alive is not an option. But we can observe how a superposition of many 
photon waves progressively collapses as it interacts with a beam of atoms. 


Earlier this year, a team from the Ecole Nor- 
male Supérieure in Paris recorded jumps of 
light heralding the birth and death of a pho- 
ton trapped in a cavity’. As they describe in 
this issue (Guerlin et al., page 889)°, the same 
researchers have now performed a similar, 
more complex trick — recording exactly how 
a coherent state of many photons collapses as 
it is measured. 

A measurement process differs fundamentally 
between the classical and quantum worlds. In 
the classical realm, there is no explicit limitation 
ona measurement’s accuracy. In the quantum 
domain, by contrast, accuracy is constrained 
by the Heisenberg uncertainty principle: a 
measurement will produce a definite result, 
but one whose value is distributed according to 
the laws of probability. What is more, the meas- 
ured object will itself be fundamentally altered 
by the measurement. Thus, the clicking sound 
produced when a photon is caught by a detector 
says two things: yes, a particle was detected; but 
sorry, the way you detected it killed it, and its 
energy was converted into an electric pulse. 

But the quantum world has more subtle 
states to investigate than a single photon. Pho- 
tons, or the probabilistic wavefunctions associ- 
ated with them, can add together, or superpose. 
If they superpose coherently (in phase), their 
combined wavefunction begins to look like a 
classical wave. This coherent electromagnetic 
field is the complex beast whose collapse was 
monitored by Guerlin et al.’. 

But how did they achieve this feat, given the 
difficulties of measuring a quantum object 
without instantly destroying it? The authors’ 
‘quantum non-demolition measurements in 
a cavity quantum-electrodynamical (QED) 
system required profound understanding of 
quantum mechanics, continuous theoretical 
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elucidation of subtle details of cavity QED, 
and unprecedented dedication in realizing a 
simple theoretical model in the laboratory. 
This model’ first required the development of 
a pair of superconducting mirrors for the walls 
of the cavity whose losses are low enough that 
light remains captured between them for the 
length of time it would take the light to circle 
Earth at the Equator. 

The second pivotal ingredient is individual 
rubidium atoms in a ‘Rydberg’ state in which 
one electron is highly excited. These atoms are 
like little planetary systems, with the excited 
electrons on a distant orbit around a remote 
atomic nucleus. They can oscillate between 
two different excited states, and the regularity 
of this oscillation makes them excellent time- 
keepers. The frequency of that oscillation is 
easily disturbed in the presence of light — to 
the extent that it can be used to detect the pres- 
ence ofa single photon non-destructively’. 

And so Guerlin et al.” prepared a coherent 
state of a microwave electromagnetic field in 
their cavity that contained up to seven photons, 
and then sent in the Rydberg atoms. They made 
sure that these atomic clocks had their ‘hands’ 
— their initial phase — set to one of eight pos- 
sible values. Each of these settings is a possible 
answer to the question “How many photons are 
there in the cavity?” The interaction with the 
electromagnetic field modifies the setting of 
the hands according to the answer, from none 
to seven. The atoms leave the cavity, and the 
new direction of the hands is recorded. 

Guerlin and colleagues’ measurement 
method had to fulfil certain conditions for 
success’. First, the interaction of the quantum 
probe (the atom) with the remaining, classi- 
cal part of the measuring device (the ‘click’ 
of the detector) must not begin until the 
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Figure 1| State surveillance. a, As a stream of rubidium atoms passes through a mirror-walled cavity 
filled by a coherent light field in Guerlin and colleagues’ experiment’, the atomic ‘clocks’ are disturbed 
according to the number of photons present in the light field. The probability for different photon 
numbers is initially evenly distributed between the possible outcomes (none to seven photons), but 
‘collapses’ to a definite value as more and more atoms pass through — to five, in this case. b, A different 
measurement of the same state collapses to a different photon number — seven. ¢, By repeating the 
same measurement over and over again, an overall (poissonian) probability distribution for the 
number of photons in the initial coherent light field can be gathered. (Figure modified from ref. 2.) 


interaction of the atom with the electromag- 
netic field is complete. Second, the interaction 
with the classical part must not contribute 
significantly to the total error of the measure- 
ment. Finally, there should be no fundamental 
constraint on the precision of a measurement: 
in quantum-mechanical terms, the phase of the 
electromagnetic field (which according to the 
uncertainty principle is the ‘canonical conjugate’ 
of the number of photons) should be maximally 
perturbed by the measurement in order to 
measure the photon number accurately. 

Each individual measurement in this pro- 
cess gives only one answer about the number 
of photons in the coherent state. As the meas- 
urement process is started, these answers are 
uniformly (randomly) distributed among 
the possible values, but as the measurement 
progresses and more and more atoms are 
sent through the light field, the answer con- 
verges (collapses) to a particular value. Start- 
ing the process all over again with a freshly 
trapped, but identically prepared, light 
field, the measurement converges again to 
a different number. By repeating the meas- 
urement process again and again, the distri- 
bution of the number of photons in the initial 


coherent state can be reconstructed (Fig. 1). 
The ability to follow a quantum measure- 
ment stage by stage as the wavefunction col- 
lapses opens up broad avenues of approach to 
fundamental questions that were previously 
accessible only in simulations. The process of 
measurement might now be manipulated to 
steer a system to a particular final state using 
quantum feedback or to explore conditional 
quantum dynamics — using the outcome of a 
measurement to determine a systems evolution. 
The Parisian team is uniquely placed to advance 
farther into the quantum jungle, a largely unex- 
plored area where new species of Schrédinger’s 
cat rub shoulders with exotic ‘quantum butter- 
flies’ — flighty states that flutter between values 
when disturbed by measurement, only to settle 
down to rest again. a 
Luis A. Orozco is at the Joint Quantum Institute, 
Department of Physics, University of Maryland, 
College Park, Maryland 20742, USA. 
e-mail: lorozco@umd.edu 
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50 YEARS AGO 
On the Beach by N. Shute — 

THE theme of “On the Beach” 

is the extinction of the human 
race resulting from an atomic 
war. Everybody dies. Just 

that. When the novel opens in 
Melbourne, 1962, nobody is alive 
inthe northern hemisphere. 
Movements of the atmosphere 
are steadily carrying lethal 
particles southwards. Mr Shute 
has deployed his remarkable 
imagination as engineer, naval 
officer and storyteller... On his 
immense popular following this 
book can only inflict a haunting 
distress: one takes off one’s hat 
to him. Mr Shute has limited 

his canvas: he has taken only 

five main characters—five very 
ordinary people at that; and he 
has pitched the emotional and 
dramatic tone invariably low. Until 
the moment when the radiation 
sickness comes on, everybody 
sticks to the tamest of domestic 
preoccupations; and then quietly 
takes a suicide pill. The effect is 
hypnotic and also odd. Mr Shute's 
world ends not, as the epigraph 
warns, with a whimper, but 

with a stoical silence, movingly 
impressive... The moral in so far 
as Mr Shute states it explicitly, 
comes via his characters thus: 
“Maybe we've been too silly to 
deserve a world like this”... 

From Nature, 24 August 1957. 


100 YEARS AGO 

According to Engineering, an 
Australian record in wireless 
telegraphy has been achieved by 
the successful transmission of 
messages from H.M.S. Challenger, 
one of the Australian squadron 

at present stationed in Hobson's 
Bay, to the flagship Powerful, 
which at the time was moored 

in Farm Cove, Port Jackson. The 
Challenger was in communication 
with the flagship by means of 
wireless telegraphy the whole of 
her voyage. The longest message 
was one flashed over a distance of 
410 miles ina direct line, and this 
constitutes an Australian record, 
as previously never more than 
240 miles had been achieved by 


warships on the Australian station. 


From Nature, 22 August 1907. 
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Age, health and wealth 


Frances Cairncross 


Ageing populations raise the spectre of crippling healthcare costs. Extra 
spending on medical research might bring healthier, happier older people 
who work (and pay taxes) for longer. Is that a good investment? 


Many of the changes that will dominate the 
first half of the coming century are uncertain, 
but one is not: the global population will grow 
older. Already those aged 65 or over, who have 
rarely accounted for more than 2-3% of the 
populations of most countries, make up 15% 
of the rich world’s inhabitants. The propor- 
tion is also rising in poor countries, as women 
in many nations have fewer children, and life 
expectancy rises, not just in infancy but in late 
middle-age and even in old age. 

Hallmark Cards now sells about 85,000 
100th-birthday cards a year, in three differ- 
ent designs. Reaching the century is all very 
well — as long as it does not mean spending a 
quarter of a century incapacitated or in a nurs- 
ing home. So one of the most pressing issues 
about the rise in life expectancy at older ages is 
whether more years of life mean more years of 
disability or more years of good health. 

This matters not only for those with grey- 
ing hair, but also for governments. A conse- 
quence of falling fertility rates is that there will 
be fewer young people relative to the number 
of old people. One result is an upward creep in 
the average age of the labour force; another is a 
fall in the proportion of workers to non-work- 
ing old folk. The upshot is fewer taxpayers, but 
also more people who are not contributing to 
the economy and are likely to draw increas- 
ingly on health services. If the books are to 
balance, people will need to work for longer. 
But will their health be good enough for them 
to do so? And ifit is, will they be as productive 
as younger folk, or will an older workforce in 
industrialized countries lose its competive edge 
against industrializing countries that still have 
youthful employees? 

These matters are the starting point for a 
new paper’ on labour-force participation, 
increases in human capital and the implica- 
tions for spending on research. The lead author 
is Kenneth Manton, who has revolutionized the 
conventional wisdom on health in old age. He 
is a mathematical demographer who ran the 
National Long Term Care Survey (NLTCS), a 
survey that began in 1982 and is designed to 
provide information about the population of 
chronically disabled elderly people in the United 
States. As Manton began to publish results””, 
the initial response was one of incredulity: con- 
trary to the prevailing wisdom of gerontologists, 
who thought that modern medicine would not 
cure or prevent the chronic diseases of old 
age, the results showed that chronic disability 


If the economic ie areito 
balance, people will needto 
work for longer. 


among over-65s is declining, and at a gradually 
accelerating pace. The most recent NLTCS 
shows that the pace of decline has increased 
from 0.6% in 1982 to 2.2% in 1999-2004. 

Will this transformation continue? If it does, 
the savings in both human misery and govern- 
ment expenditure will be tremendous, as will 
the benefits to the world economy. Instead of a 
growing population of disabled elderly in need 
of expensive long-term care, many countries can 
look forward to an army of spry, intelligent and 
hard-working old folk. The role model might be 
Alan Greenspan, who went on running the US 
central bank until he was almost 80. 

Manton and his colleagues are optimists. 
Health quality is increasing faster than life span, 
they argue, and has been on the rise for much 
of the past century. The value of this improve- 
ment to individuals and to the economy can 
be quantified. There is every reason to believe, 
they say, that the improvement can continue. 
Many of the gains in both physical and mental 
health of the elderly result from innovations 
in medical care. If this process continues, the 
economic gains will grow still larger, greatly 
reducing worries about the financial burden 
of an ageing society. 

Therefore — and this is the nub of the authors’ 
argument — there are strong financial reasons 
for governments, and especially the US govern- 
ment, to support hefty investment in medical 
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research. Even if only 30% of those old people 
with better health choose to continue working, 
the authors conclude that the economic gains 
overall would be sufficient to support a doubling 
of investment in biomedical research over five 
years, and a quadrupling over a longer period. 

This cheerful conclusion raises several ques- 
tions, of which [ll mention two. The first is 
whether Manton and colleagues’ optimism is 
justifiable. Some recent research raises doubts, 
driven mainly by swelling waistlines in the 
United States. For instance, the initial results 
of another, and newer, long-term survey, the 
Health and Retirement Study, sponsored by the 
US National Institute of Aging, provides a view 
not only of people aged 65 and over (as in the 
NLTCS) but of the older middle-aged. Analys- 
ing its results, Cutler and colleagues* conclude 
that current trends in obesity could reverse 
about one-third of the gains in life expectancy 
within the next 30 years. 

On the other hand, there clearly remains 
much scope for medical advance. For a review 
of potential biomedical advances by the Cent- 
ers for Medicare & Medicaid Services, panels of 
technical experts were asked to identify possi- 
ble breakthroughs that would affect the health 
and life expectancy of the elderly. They came 
up with a list of 33 that are likely to become 
commonplace during the next two decades. 
Even obesity may turn out to be less of a chal- 
lenge than it now seems. The proportion of 
overweight people will not rise for ever; some 
of us will manage to keep our figures as we age. 
And the consequences of obesity are becoming 
more treatable as the years go by. 

The second question is whether Manton et 
al. are right to put so much emphasis on the 
economic benefits to be gained from medical 
research. One of Manton’s collaborators (and 
a co-author of this paper) is Arthur Ullian, a 
wealthy and driven advocate of more spend- 
ing on health research, who gave up a career in 
property after a serious accident left him in a 
wheelchair. The paper’ sounds a bit like a lob- 
bying document for the US Senate appropria- 
tions committee, a body that has sometimes 
taken the view that expensive health interven- 
tions merely drive up costs. 

Nonetheless, the argument that the returns 
on investment in medical-care technology and 
pharmacology have so far easily outstripped 
their costs is persuasive. In particular, the 
development of statins for reducing blood 
cholesterol levels must be one of the most 
economically beneficial inventions of the past 
three decades. But medical research is surely 
worth conducting for the improvements it 
brings to human happiness, as well as to pro- 
ductivity. And it is not the only way to extend 
life expectancy. Changing people's behaviour 
is also enormously important. 

The fall in the number of people smoking and 
the rise in education have been two powerful 
influences on healthier life expectancy. But the 
well-educated have enjoyed much larger gains 
both in life expectancy and in health in old age 
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than have the less educated — and the latter 
find it hardest to abandon their cigarettes. If 
one is talking about cost-effective ways to keep 
even more old folk sprightly, then the best bet 
may be to improve the education of the poor 
and to persuade them to stop smoking. a 
Frances Cairncross is at Exeter College, 

University of Oxford, Oxford OX13DP, UK. 
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MATERIALS SCIENCE 


Stirring stuff 


David J. Pine 


Take silicon, soak in water, add acid — and stir. This simple new recipe for 
the self-assembly of complex microstructures belies an involved sequence 
of hydrophobic, electrostatic and van der Waals interactions. 


In folding its proteins and constructing its com- 
plex membranes, nature uses self-assembly: 
bathed in water or another liquid, tiny building- 
blocks come together by virtue of their shape 
and interactions. As they report in the journal 
Small, Onoe et al.' adapt these natural processes 
for their own designs. They describe a method 
for assembling parts just 10 micrometres across 
into complex, three-dimensional objects, and go 
on to build up chains of interlocking rings. The 
research is another step towards the ultimate 
goal of building electrical or optical circuits, or 
even microscopic machines, from components 
at micrometre and smaller scales. 

The folding of a protein molecule from a 
long, linear sequence of linked amino acids is 
one of nature’s more spectacular demonstra- 
tions of self-assembly. In a watery environment, 
certain amino acids along the chain can attract 
each other by various means — van der Waals 
attraction, hydrogen bonding or hydrophobic 
interactions. Others might repel each other 
through their electrical charges or hydrophilic 
interactions. Geometrical constraints limit 
which amino acids along the chain can inter- 
act with each other, so which amino acid occu- 
pies a given position in the chain is crucial to 
the final topology and shape of the protein. 
Geometry thus conspires with attractive and 
repulsive forces to fold the amino-acid chain 
into the complex shape that gives a protein its 
particular function. 

Simpler examples of self-assembly also 
abound. Soap molecules, or ‘surfactants’ are one. 
These consist of a water-loving (hydrophilic) 
head connected to a water-hating (hydrophobic) 
tail. When placed in water, the hydrophobic 
tails of different surfactant molecules bunch 
together to form the core ofa sphere, with their 
hydrophilic heads at the surface. This way, both 
heads and tails get to live in their desired sur- 
roundings. While arranging themselves, the 
tails often corral a particle of dirt, isolating it so 
that it can be washed away. 
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Onoe and colleagues’ exploit the same ideas 
of self-assembly, except that their building- 
blocks are not organic molecules, but particles 
of silicon formed into slightly tapered cylinders 
10 um in diameter at the wider end. This wider 
base is coated with a layer of hydrophobic mol- 
ecules, whereas a thin layer of silicon dioxide 
(SiO,) forms naturally through oxidation of 
the remaining surfaces in air. When placed in 
water with a nearly neutral pH of 6.5, the SiO, 
surfaces become negatively charged and repel 
each other when two particles come close. 
The hydrophobic bases of two cylinders, by 
contrast, can avoid contact with the water by 
pairing up flush against each other. And this 
is exactly what happens when the water is 
stirred to bring particles close to one another: 
the hydrophobic surfaces pair up to form 
barrel-shaped ‘dimers’ (Fig. 1a). 

The authors then add a small drop of acid to 
the water. At the resulting lower pH of 2.0, the 
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SiO, surfaces lose their charge, and the barrel- 
shaped dimers no longer repel each other. In 
fact, when close together they experience an 
attractive van der Waals force, which is strong- 
est between the exposed flat tops of the dimers, 
which provide the largest area of contact. (The 
authors also roughen the round sides of the 
dimers to diminish the attractive force between 
the already small contact area of the curved 
surfaces.) So, when the acid solution is gently 
agitated, the barrel-shaped dimers undergo a 
further stage of self-assembly into long, thin 
cylinders. 

The next challenge, making a chain of linked 
rings, takes the process a significant step fur- 
ther. Starting this time with particles shaped 
like a wide squat U, with a square hydrophobic 
patch on the inside of the base, Onoe et al. fol- 
low the same procedure as before (Fig. 1b). 
The particles first link up at the hydrophobic 
patches, and then, when the pH is lowered, 
connect at their ends by van der Waals attrac- 
tion to form rings. Only a couple of links are 
demonstrated, but the ability to form such 
topologically complex objects is intriguing. 

This work follows on from the pioneering 
experiments of George Whitesides and his 
colleagues, who exploited hydrophobic and 
hydrophilic interactions, as well as capillary 
action, to build complex structures on scales 
of 10 um and larger: rings, tetrahedraand tiny 
rugby balls’; chiral helices’; and stacked plates 
and three-dimensional crystals’. Here too, the 
geometry of the building-blocks largely deter- 
mines the objects that are assembled. Particles 
can even be shaped to fit together in a lock- 
and-key manner’. More recently, this mecha- 
nism has been exploited on length scales of a 
micrometre to build up an ‘alphabet soup’ of 
particle shapes’. These smaller particles are 
batted about naturally by thermal brownian 
motion, self-assembling with no external inter- 
vention, such as stirring, at all. 

What is especially interesting about Onoe and 


Figure 1| Self assembly in action. Onoe and colleagues’ processes seen under the microscope, 
starting from (a) tapered cylinders and (b) U-shaped particles. (Images reproduced, with permission, 


from ref. 1.) 
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colleagues’ technique’ is the sequential control 
over a self-assembly process that involves differ- 
ent interactions — first hydrophobic forces, then 
electrostatic repulsion and finally van der Waals 
attraction. Sequential self-assembly has been 
carried out before, most notably in beautiful 
experiments by Ned Seeman® and Chad Mirkin’ 
and their colleagues using artificial sequences of 
DNA. But those experiments use the coding of 
DNA base pairs rather than different forces to 
control the sequence of self-assembly. Moreover, 
they are ‘biokleptic’ (Seeman’s term): that is, they 
borrow heavily from biological processes. Onoe 
and colleagues’ approach is more general and, 
at the current stage of development, much less 
powerful. It does, however, represent a useful 
addition to our toolbox. 

To develop self-assembly into a practical 
technology, we will need to be even more inge- 
nious than nature, exploiting all the interactions 


at our disposal and creating yet-to-be envis- 
aged pathways. The recent work' takes usa step 
further down that road. r 
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EVOLUTIONARY BIOLOGY 


Structure in mutualistic networks 


Susanne S. Renner 


Statistical analyses of the networks formed by plant-animal mutualisms 
can now take account of the relatedness of the players on either side. How 
helpful is this innovation for understanding network dynamics? 


The mutually beneficial relationships between 
plants and animals take several forms. One 
example is pollination. Another is the process 
by which a fruit-eating creature, a frugivore, 
gets a meal and subsequently disperses a plant's 
seeds in its droppings. 

In the context of a local ecological commu- 
nity, such relationships can be seen as networks 
in which the evolutionary dynamics of the part- 
ners may be mutually dependent, contributing 
to an array of coevolutionary processes’. 

On page 925 of this issue, Rezende et al.’ 
report an analysis of plant-pollinator and 
plant-frugivore networks that includes infor- 
mation on the evolutionary history of the part- 
ners — that is, on the phylogenetic relatedness 
of each partner to other plants or animals in the 
network. Their aim was to determine whether 
relatedness affects network structure and 
whether it predicts ‘cascades of coextinction. 
This is the first such evolutionary network anal- 
ysis, and it highlights both the power and the 
limitations of the approach. Two central con- 
cepts are those of species ‘strength’ and species 
‘degree’ (as described in Box 1, overleaf, which 
gives the background to the method). Combin- 
ing the two concepts has produced a boom in 
the analysis of plant-animal networks™*. 

In mutualistic networks, optional interactions 
occurring among many species are common; 
most interactions are strongly asymmetric; and 
species interact with nested subsets of partners’**. 
Thus, a large number of species having few inter- 


actions coexists with a relatively small number 
of super-generalists. Relatively rare plants and 
animals, as well as those with comparatively few 
partners, interact primarily with a core group of 
abundant generalist species. 

Related participants in mutualisms are likely 
to have similar morphology, physiology and 
behaviour. These are traits that evolve as line- 
ages diversify, so we would expect the structure 
of mutualistic networks to be influenced by the 
hierarchical phylogenetic relationships present 
ina particular community. This is the assump- 
tion Rezende et al.’ set out to test. Statistical 
methods to estimate the role of phylogenies 
in explaining patterns of trophic (feeding) 
association are now available”; these meth- 
ods are based on established statistics for the 
phylogenetic comparative method*”. The basic 
approach is to structure the problem of pollina- 
tor—flower associations as a statistical model in 
which phylogenies are used to give the covari- 
ance structure of the ‘error’ terms. 

Rezende et al. applied these methods to 59 
plant-pollinator and plant-frugivore networks, 
which were compiled from the literature. Their 
approach involved obtaining phylogenies for 
the insects in 35 networks, birds (all frugivores) 
in 18, and flowering plants in 52. The number 
of interactions per species was significantly 
phylogenetically conservative in 25% of the 
phylogenies (26 of 105) and a third of the net- 
works. Small phylogenies provide little power to 
detect phylogenetic effects, and when only the 
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22 phylogenies with more than 70 species were 
considered, this percentage increased to 55%. 
A correlation between phylogenetic relatedness 
and ecological similarity — the fraction of com- 
mon interactors between two species — was 
detectable in 43% (50 of 103) of phylogenies. 

For their ten largest communities (those with 
more than 40 animal and plant mutualists), 
Rezende et al. simulated coextinction cascades 
by removing pollinators systematically, start- 
ing with the least-linked (most specialized) 
partners and moving to the most-linked (most 
generalized). Species left without any local 
interaction were assumed to become extinct. 
The simulations show that related species do 
tend to become extinct together, although the 
effects were small. 

Overall, given the modest percentage cor- 
relations cited above, Rezende and colleagues 
find that phylogenetic relationships do not have 
a marked effect on the degree and strength of 
interactions in local communities. This result 
will disappoint those who expect to find such 
a signal in every nook and cranny. One expla- 
nation may lie in the hugely varied temporal 
scales over which the hundreds of plant and 
animal species in the 59 local networks have 
been interacting. The networks encompass 
Mediterranean, tropical, temperate, subtropical 
and Arctic communities, which differ greatly 
in their stability and numbers of evolution- 
arily old or recent species. Also, some of the 
communities have comparatively few closely 
related species, and so lack a strong phyloge- 
netic structure. Given the different temporal 
and spatial scales over which different pol- 
linators (birds, bees, flies, beetles) and plants 
(tropical or temperate, woody or herbaceous) 
evolve, very large networks may be needed to 
discern phylogenetic signals in interaction 
strength and degree”. 

However, even then, phylogeny might not 
predict numbers and kinds of mutualistic 
interactions. As has been shown" for a network 
involving many insects and one focal plant, 
broad-leaved lavender (Lavandula latifolia), 
having few or many visitors to a flower may not 
bea trait that is invariant at the species level, 
but instead may depend on research design 
(sampling effort) and biological phenomena 
(variation in absolute and relative pollinator 
abundance or visitation rates). If a large pro- 
portion of the interactions counted in pollina- 
tion networks are not species-level traits, this 
would explain the absence of clear phylogenetic 
effects in insect-flower networks. 

There is a risk that treating mutualistic net- 
works as “coevolved structures rather than 
as diffuse multispecific interactions”* could 
lead research on networks into a trap from 
which community ecology has long escaped”. 
Instead of revealing coevolved interactions, 
Rezende and colleagues’ results might be taken 
as showing that such interactions are not very 
important. And in terms of extinctions, the 
formation of associations between migrating 
or invading species and local generalist species 
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Box 1| Methods for quantifying mutualisms 


Quantitative approaches to 
understanding plant-animal 
mutualisms go back to the 
application of food-web 
theory to a large sample of 
pollination and seed-dispersal 
networks, Mutualistic 
interactions can be arranged 
as a matrix of dependences 
between a =1, 2,...,ispecies of 
animal and p = 1, 2,...,7 species 
of plant, with the total number 
of species in the system, M, 
equal toa +p. 

One can then examine the 
proportion of all possible 
interactions that actually 
occur and, for interacting 
pairs of species, estimate the 


relative magnitude of the 
interaction, or the number 
of interactions per species 
(referred to as species 
‘degree’). The realized number 
of interactions (connectance) 
scales in proportion to M; 
the potential number of 
interactions scales as a x p; 
the average number of 
interactions per species varies 
independently of M (ref. 13). 
Because connectance covaries 
with species richness (as in 
other food webs), network 
studies control for M by 
doing regression analysis on 
residuals. 

Recently, two further 


parameters have been 
introduced, quantifying 
mutual dependence or 
species ‘strength’. The first is 
the dependence of plant 
species j on animal species i 
— that is, the fraction of all 
visits from a particular animal 
species. The second is the 
dependence of animal species 
ion plant species j (that is, the 
fraction of all visits by this 
animal species going toa 
particular plant species). This 
allows an index of asymmetry 
to be calculated for each 
pairwise interaction, paving 
the way for the latest 


analyses”. S.S.R. 


might actually encourage the persistence of 
local species in the long term”*"”. 
Nonetheless, with habitat fragmentation and 
climate change now occurring so rapidly, mutu- 
alistic networks are likely to be severely affected 
in many places. Setting networks in an evolu- 
tionary context might help to predict their level 
of resilience. Using the approach developed by 
Rezende et al., future work on networks might 
incorporate indirect antagonistic interactions 
affecting mutualistic networks — such as those 
in which herbivores consume flowers or fruits 
to the detriment of the plants concerned. 
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MATERIALS SCIENCE 


Embedded shells decalcified 


Catherine Picart and Dennis E. Discher 


Synthetic microcapsules with membrane-bound inner chambers in which 
chemical reactions can be isolated and controlled have been assembled, 
layer by layer. Could artificial cells be on the horizon? 


The next time you eat an apple, take a moment 
to consider its hierarchical structure. The 
thin, durable skin wraps around the delecta- 
ble fruit, in which can be found seeds; those 
seeds, in turn, have a coating of their own to 
encase their contents. Animal cells possess a 
similar arrangement of embedded compart- 
ments: cell membranes confine the machinery 
of life, including the nuclei with their mem- 
brane-ensconced genomes. Both cell and 
nuclear membranes self-assemble from lipid 
molecules, and are fortified with networks of 
proteins whose structures are templated by 
interactions with the membranes. The hierar- 
chical assembly of a synthetic, cell-like struc- 


ture has now been described in Angewandte 
Chemie by Kreft et al.'. They have used micro- 
spheres of calcium carbonate as templates to 
construct polymeric, shell-within-shell cap- 
sules. Once the structures are assembled, the 
templates are simply washed away. 

The authors adopted a strategy known as 
layer-by-layer (LbL) assembly, which entails 
dip-coating a template alternately in two 
polymer solutions, one of which is positively 
charged and the other negatively charged. A 
wide range of polymers can be deposited in 
this way, to make multilayered structures with 
an array of physical and chemical properties, 
and many potential applications’. In addition, 
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these surface-grown films can coat a broad 
range of substrates, from large flat surfaces 
to highly curved colloidal particles. Flexible 
polymers can be layered onto complex, curved 
geometries — for example, red blood cells, 
which have a concave structure, have been 
used as templates for LbL assembly’. 

Kreft et al.' used microspheres of ionic cal- 
cium carbonate as the template for the inner 
capsule of their structures. These spheres offer 
two advantages as templates for LbL assembly. 
First, they can be dissolved at the end of the 
process by washing with a solution of EDTA 
(a molecule that traps calcium ions); this is a 
milder way of removing templates than previ- 
ously reported methods, which involved harsh 
organic solvents. Second, calcium carbonate 
can be used as a matrix to trap biomolecules, 
including proteins and DNA‘ as the processes 
involved are mild and ‘biofriendly’ 

The authors began by forming microspheres 
of calcium carbonate, using them as a matrix 
to immobilize protein molecules (Fig. 1, over- 
leaf). They also trapped magnetic nanoparti- 
cles in the spheres to provide an easy method 
of separating intermediates from unwanted 
co-products during the assembly process. The 
authors coated the spheres with ten alternating 
layers of synthetic polymers, and then grewa 
thick shell of calcium carbonate on the surface 
of the resulting polymer film; a second kind of 
protein was incorporated into this layer. The 
authors deposited more layers of polymers 
onto the outer shell, and finally removed the 
calcium carbonate by washing with EDTA. 
This yielded micrometre-sized, shell-within- 
shell microcapsules, filled with proteins; like a 
cell, the proteins in the synthetic ‘nucleus’ were 
different from those in the ‘cytoplasnv. 

Both cell and nuclear membranes are semi- 
permeable — they permit some molecules to 
pass, but not others, which allows many essen- 
tial reactions to be confined to either the nucleus 
or the cytoplasm. Although small molecules 
diffuse through synthetic membranes, many 
proteins and nanoparticles are retained within 
LbL capsules, so LbL shells can also selectively 
control which molecules pass through them. 

Kreft and colleagues’ used this effect to 
compartmentalize an enzyme reaction. They 
prepared a shell-within-a-shell system in 
which enzymes in the outer chamber oxidize 
water molecules to form hydrogen peroxide. 
The peroxide readily diffuses into the inner 
chamber, where it oxidizes an encapsulated dye; 
this produces a red colour that slowly diffuses 
back into the outer compartment. This proof- 
of-principle reaction will no doubt spur inter- 
est in other embedded reactions, and perhaps 
stimulate new designs for synthetic cells. 

Compartmentalized structures that mimic 
cells have been generated using other mild 
processes. One interesting approach is to trap 
a solution of two polymers within a lipid vesi- 
cle’; if the polymers and conditions are care- 
fully chosen, a central pool forms in which 
only one of the polymers is dissolved; this is 
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Figure 1 | Polymer shells within shells. Kreft et al.‘ have prepared polymeric shell-within-shell 
capsules. a, The authors prepared calcium-carbonate microspheres in which magnetic nanoparticles 
and a protein were embedded. b, The spheres acted as templates for a polymer film. c, The authors 
then deposited a thick shell of calcium carbonate in which a different protein was embedded, and 
(d) separated the desired material from co-products using a magnet. e, A second polymer layer was 
deposited on the outer sphere. f, Finally, the authors extracted the calcium carbonate to leave behind 
a polymeric shell-within-shell structure, with different proteins trapped in separate compartments. 


(Figure adapted with permission from ref. 1.) 


surrounded by a separate solution of the other 
polymer. Such a system lacks a semi-perme- 
able membrane to separate the two solutions, 
but has the advantage that the whole thing can 
easily be triggered to break down using vari- 
ous stimuli, such as heating; this is potentially 
useful for applications in which the capsule is 
being used as a delivery system. In comparison, 
Kreft and colleagues’ approach yields physically 
robust shells, but these could, in principle, be 
made with degradable polymers. 

Kreft et al. comment’ that one of the long- 
term goals of their work is to produce drug- 
delivery systems that can release drugs ina 


controlled manner, or in response to a biological 
trigger. For this application, biodegradable mul- 
tilayered materials are especially useful. Interest- 
ingly, several such polymeric materials are about 
as compliant’ as cell and nuclear membranes’ 
— that is, their tendency to revert back to their 
original shape upon the removal ofa distorting 
force matches that of their biological counter- 
parts. Physical properties such as compliance 
are surprisingly important in determining the 
compatibility of materials with biological sys- 
tems. For example, it is now known that flex- 
ible objects are less prone to being engulfed by 
phagocytic cells, which clear foreign particles 


from the bloodstream and often prevent drug- 
delivery systems from working’. The ability to 
eliminate rigidity from drug-delivery systems 
makes Kreft and colleagues’ decalcification 
approach’ even more attractive. 

A grander challenge for LbL assembly is to 
prepare functional mimics of nucleated cells. 
Perhaps the ingredients for transcribing DNA 
into messenger RNA could be trapped in the 
synthetic nucleus of a cell mimic; the RNA 
could then be translated into proteins in the 
surrounding ‘cytoplasm. This is undoubtedly 
ambitious, but such systems might help answer 
a fundamental question of biology: why do 
cells store DNA in a nucleus? Regardless of the 
directions pursued, Kreft and colleagues’ mild 
method for the LbL assembly of nucleated sys- 
tems and compartmentalized reactions is likely 
to stimulate the work of materials scientists for 
years to come. a 
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EARTH SCIENCE 


Old diamonds and the upper crust 


lan S. Williams 


Was the early Earth a blackened landscape of congealed lava, or was it cool 
enough for oceans to form? The discovery of diamonds in the oldest-known 
relics of surface rocks adds new élan to this debate. 


The earliest era of Earth’s history — between 
the creation of the planet about 4.5 billion years 
ago and the formation of the oldest known 
rocks 500 million years later — is the geological 
equivalent of the Dark Ages. The formal name 
given to the period, the Hadean, matches the 
long-held view that during this time Earth’s 
entire surface, or at least large tracts of it, was 
a hellish, seething mass of molten lava. In con- 
trast to that picture, recent studies of the old- 
est known relics of surface rocks, tiny and rare 
crystals of the trace mineral zircon (ZrSiO,), 
indicate that it took less than 200 million years 
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for Earth’s surface to cool sufficiently for water 
to condense. Menneken and colleagues’ discov- 
ery’ (page 917 of this issue) that some of those 
ancient zircon crystals contain tiny diamonds 
turns the field on its head once more. The 
results lend succour to the view that the initial 
hot phase was long-lived, and will invigorate 
questioning of the cool-Earth concept. 

As Earth grew in the dust cloud around the 
proto-Sun, probably colliding with a planet 
the size of Mars during the process, surface 
temperatures would have reached in excess of 
6,000 °C (ref. 2). When the surface eventually 
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cooled, solid rocks started to form, but what 
those rocks were like, and when the formation 
process started, are controversial questions. The 
problem is one of preservation. Intense meteor- 
ite bombardment destroyed much of the early 
crustal rock, and with it evidence of Earth’s early 
environment. In addition, Earth is an active 
planet on which crustal rocks are continuously 
destroyed by weathering and erosion, and recy- 
cled deep into the interior through plate tecton- 
ics. These processes have been going on for at 
least 3.8 billion years, and so very few crustal 
rocks older than 3.7 billion years — and none 
older than 4.03 billion years — are known. 

The only surviving relics of Earth’s crust 
from those first 500 million years of the planet's 
history are zircon crystals, mostly recovered 
from slightly less ancient sedimentary rocks’. 
Zircon is a tough mineral: it is both chemi- 
cally and physically inert under a wide range 
of geological conditions. As a result, zircon 
grains formed in old igneous and metamor- 
phic rocks are commonly recycled unchanged 
into younger rocks. 
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Hadean zircon grains are small (less than 
0.3 mm in diameter) and rare: so far only two 
rocks, both from Western Australia, have been 
found with an ‘abundance’ of the grains (which 
means about | part per million by weight). 
Fortunately for those studying the early Earth, 
these tiny grains carry a remarkable amount 
of information about the rocks in which they 
crystallized. That information can be extracted 
through a range of modern techniques that 
allow growth zones in individual zircon grains 
to be analysed on a scale of a few micrometres. 

For example, uranium, thorium and lead iso- 
topes provide an accurate measure ofa grain’s 
age* ; zoning textures’ and patterns of rare-earth 
elements’ distinguish igneous from metamor- 
phic growth; and microinclusions of other min- 
erals show the mineralogy of the original host 
rock’. The more titanium a zircon grain con- 
tains, the higher its crystallization temperature 
was’; and the greater the grain’s enrichment in 
the oxygen isotope '%O, the more the magma 
from which the zircon crystallized interacted 
with liquid surface water’. The hafnium iso- 
topic ratio 'Hf/'”’Hf indicates whether that 
magma was purely mantle-derived (high initial 
value) or contained a crustal component (low 
initial value)*”. Finally, xenon isotopes, among 
them the fission products of the extinct pluto- 
nium isotope *““Pu, provide some of the most 
direct evidence available that Earth formed 
with a plutonium-uranium ratio similar to 
that of the Sun”®. 

Some of the conclusions reached on this 
basis have been challenged, principally on the 
grounds that some of the compositions meas- 
ured do notaccurately reflect the temperature or 
the primary chemical and isotopic compositions 
of the magmas from which the zircons crystal- 
lized. But a majority view prevails. Hadean 
zircons are mostly of igneous origin, and crys- 
tallized in a series of events that occurred mainly 
between 4.25 and 4.00 billion years ago; crys- 
tallization temperatures averaged only about 
680 °C (ref. 8); the host magmas had composi- 
tions similar to granites, and were produced by 
the melting of mixtures of crustal and mantle 
rocks; and the crustal rocks had interacted with 
liquid water at or near Earth’s surface. Allin all, 


Figure 1| Treasure 
trove. Menneken 
and colleagues’ 
found diamonds in 
zircons, the earliest 
known remnants 
of Earth’s crustal 
rocks that occur in 
conglomerates at 
Jack Hills in Western 
Australia. 


this is convincing evidence that there were at 
least some periods during Earth’s early history 
when its surface was cool. 

The discovery by Menneken et al.' of micro- 
diamond inclusions in about 1 in 20 zircons, 
ranging in age from 4.3 to 3.1 billion years, that 
were randomly selected from the original Jack 
Hills conglomerate site in Western Australia’ 
(Fig. 1), will necessitate a careful reassessment 
of these conclusions. Diamond isa mineral that 
forms only at high pressures, whether these 
are transient (during, for example, a meteor- 
ite impact) or sustained (through deep burial, 
for instance). It is therefore not found in low- 
temperature granite magmas such as those in 
which the Hadean zircons are inferred to have 
crystallized. 

Assuming that the zircon did crystallize 
at temperatures of about 680 °C, pressures 
exceeding 3.5 gigapascals would have been 
needed for diamond to be stable in the magma 
source. This pressure is equivalent to burial 
more than 100 kilometres below Earth’s sur- 
face. But the Jack Hills zircons show no other 
evidence of such high pressures — whether 
transient in the form of shock features, or 
sustained in the form of relics of other high- 
pressure minerals, metamorphic overgrowths 
or rare-earth element compositions indicative 
of high-pressure growth. If the evidence of the 
diamonds is correct and the Hadean zircons 
did crystallize from magmas at high pressure, 
then those magmas could not have been crus- 
tal melts. This would undermine other infer- 
ences based on the assumption that the zircons 
preserve their original chemical and isotopic 
compositions — including the deduction that 
the early Earth was cool. 

Menneken et al.’ present two possible expla- 
nations for the presence of diamonds in zircons 
of such a wide age range. Either the diamonds 
formed during a single high-pressure event 
about 4.3 billion years ago and were then recy- 
cled into younger rocks, or they were incorpo- 
rated into the Hadean zircons by an undefined 
process that was repeated several times during 
the first billion or so years of Earth’s history. Nei- 
ther of these explanations is entirely convincing, 
and the implications for surface temperatures on 
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the early Earth are not discussed. 

A possible explanation that does not contra- 
dict the evidence for a cool early Earth is that 
the diamond was introduced to the zircon not 
as diamond, but as graphite precipitated into 
fractures and other imperfections in the zir- 
con grains. De Corte et al."', for example, have 
studied microdiamond inclusions in zircon 
from rocks subjected to ultra-high pressure in 
the Kokchetav Massif in northern Kazakhstan. 
They concluded that the original carbon that the 
diamonds were formed from was precipitated 
through the reaction of carbon dioxide with a 
reduced carbon phase such as methane. 

One can conceive of a process in which a 
segment of early crust containing zircon from 
low-pressure igneous rocks of different ages 
became impregnated with carbon-rich fluids, 
possibly produced by reactions involving an 
atmosphere enriched in carbon monoxide, car- 
bon dioxide and methane”. Rapid deep burial 
of those rocks by subduction (horizontal tec- 
tonics), sagduction (vertical tectonics) or man- 
tle overturn” could then have formed diamond 
in zircon of various ages during a single event, 
possibly without changing the zircons’ chemi- 
cal and isotopic compositions. During subse- 
quent uplift, metastable diamond trapped in 
zircon grains could have been preserved, with 
some partly reverting to graphite — as indeed 
is observed’, 

Whether the diamond was indeed produced 
by such a relatively rapid process, or was of 
long-lived mantle origin, could be tested by 
examining its carbon isotopic composition 
and whether nitrogen is present as single 
nitrogen atoms (indicating little time spent in 
the mantle at relatively low temperatures) or 
nitrogen pairs and tetrahedra (long residence 
time at relatively high temperatures)’. Such 
tests will be needed to further the lively debate 
on whether the early Earth was hot, or not. & 
lan S. Williams is in the Research School of Earth 
Sciences, The Australian National University, 
Canberra, ACT, 0200, Australia. 
e-mail: ian.williams@anu.edu.au 
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Daniel Koshland (1920-2007) 


Biochemist, institution-builder and editor of Science. 


Ina research career that lasted more than 
60 years from 1945 until his death on 23 July, 
Dan Koshland used his skills as a chemist 
and highly creative thinker to advance our 
understanding of proteins and cells. An 
energetic and caring man, passionate about 
all aspects of science, his persuasive and 
insightful leadership greatly strengthened 
every institution of which he was part. 

Koshland decided to pursue science 
after reading, as a young teenager, Paul de 
Kruif’s collection of biographical essays, 
Microbe Hunters, and Arrowsmith, Sinclair 
Lewis's novel of an idealistic scientist and 
doctor. After receiving a bachelor’s degree in 
chemistry from the University of California, 
Berkeley, in 1941, Koshland was recruited to 
the Manhattan Project, and became a group 
leader under Glenn Seaborg in the team at 
the University of Chicago that successfully 
purified plutonium. When the war ended, 
he entered graduate school in Chicago 
under another giant of chemistry, Frank 
Westheimer, then a young assistant professor. 
Koshland attributed much of his success to the 
mentoring and inspiration of these two men. 

After a brief stint at Harvard, Koshland 
moved in 1951 to Brookhaven National 
Laboratory on Long Island, New York, 
where he focused on understanding 
through physical organic chemistry how 
enzymes catalyse biological reactions. He 
became fascinated by the puzzle of enzyme 
specificity: how does an enzyme select only 
one of many closely related molecules as its 
substrate? 

At the time, the experts believed that Emil 
Fischer's ‘keylock or ‘template’ theory, 
proposed in the late nineteenth century, had 
solved the problem. But Koshland realized that 
this model of a rigid enzyme could not explain 
why an enzyme catalyst is much less effective 
when a distant part of the substrate molecule 
is removed. The correct answer, published 
by Koshland in 1958 only after numerous 
rejections, is that enzymes are activated by 
changing their shape when they bind their 
substrates, and that process is influenced 
by the distant part of the substrate. This 
principle, called ‘induced fit, was eventually 
confirmed fully through detailed structures 
of enzyme-substrate complexes produced by 
X-ray crystallography. Induced shape change 
has proved to be a ubiquitous phenomenon, 
applying not only to substrate recognition, 
but also to much of the regulation of enzyme 
activity in biological systems. 

In 1965, Koshland moved back to Berkeley, 
to the Department of Biochemistry. A 
year later, he published a classic paper, 
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‘Comparison of experimental binding data 
and theoretical models in proteins containing 
subunits, now recognized to provide the 
most general model for cooperative (and 
anti-cooperative) responses in proteins. In 
the 1970s, he and his lab shifted to focus 

on understanding how cells receive and 
interpret signals, wisely selecting bacterial 
chemotaxis — the movement of bacteria in 
response to changes in the concentrations of 
certain chemicals — asa system that could 
be explored in exquisite detail. By rapidly 
changing the concentrations of the molecules 
that bacteria detect, Koshland demonstrated 
that bacterial cells have a ‘memory’ of their 
environment. In later years, members of his 
lab worked out much of the biochemistry 

of the receptor-protein methylation that 
underlies this process. 

In 1976, Koshland showed that differences 
between bacterial daughter cells can be 
caused by stochastic fluctuations inherent 
in systems driven by small numbers of 
molecules. How biology deals with such 
fluctuations is now a focus of research in 
many laboratories. In 1984, together with 
Albert Goldbeter, Koshland introduced the 
concept of ‘ultrasensitivity, revealing how 
large differences can originate from a cascade 
of enzymatic reactions downstream from 
an initial signalling event. All this work has 
changed the way we think about biology; 
Koshland’s mentorship has spawned many 
leaders in these fields. 

While maintaining an active research 
laboratory, Koshland became chair of the 
Berkeley biochemistry department in 1973 
and began a series of institution-building 
activities across the university. In the 1980s, 
he drove a revolutionary reorganization 
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of Berkeley’s 200-strong biology faculty 
from more than ten different small 
departments into three large ones. This was 
a seemingly impossible task to those versed 
in university politics, but its beneficial effect 
on the quality of both faculty and student 
recruitments was immediately apparent. 
The recent development of biology-focused 
interdisciplinary science and engineering 
efforts at Berkeley also owes much to 
Koshland. 

In 1985, Koshland became editor-in-chief 
of Science, a pivotal position for world science 
that he occupied, as a half-time post, for ten 
years. He transformed the journal, not only 
vastly improving the scientific research it 
published by appointing a cadre of talented 
full-time editors with PhD degrees, but also 
greatly increasing the quality and quantity 
of the sections dealing with science news 
and science policy. He wrote more than 200 
editorials, many in the form of dialogues 
with ‘Dr Noitall’ that were testaments to 
both his originality and his remarkable wit. 
These duties did not dent his formidable 
scientific productivity: during his editorship, 
he authored more than 100 scientific articles 
stemming from work at his beloved Berkeley 
laboratory. 

Dan Koshland was both a model scientist 
and a model human being. He exemplified 
all the values that the idealists among us 
associate with science. He was completely 
open and honest, with a generous, happy 
spirit that made him a pleasure to be around. 
He derived great pleasure from being helpful 
to everyone, and he went out of his way to 
support young scientists. I was amazed to 
discover late in his life that he, a modest man 
who always flew in economy class and drove 
a dented car, was very wealthy, heir to part 
of the Levi Strauss fortune. In fact, Dan was 
a great philanthropist for science, helping 
to finance new buildings at Berkeley and 
at Haverford College in Pennsylvania, and 
both designing and endowing a new science 
museum at the National Academy of Sciences 
in Washington DC. I had the privilege of 
working closely with him in designing that 
museum. It is named after his first wife, 
Marian (Bunny) Koshland, an outstanding 
scientist in her own right, to whom he was 
married for 52 years until her death in 1997. 

Dan is survived by five children from 
that first marriage, and by his second wife, 
Yvonne, a former college friend whom 
he married in 2000. Family, friends and 
colleagues will come together to celebrate the 
breadth of his contributions to science in a 
memorial service and symposium at Berkeley 
on 16 September. 

Bruce Alberts 

Bruce Alberts is in the Department of 
Biochemistry and Biophysics, University of 
California, San Francisco, 600 16th Street, 
San Francisco, California 94158, USA. 
e-mail: balberts@ucsf.edu 
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Integrating molecular and network biology 
to decode endocytosis 


Eva M. Schmid! & Harvey T. McMahon! 


The strength of network biology lies in its ability to derive cell biological information without a priori mechanistic or 
molecular knowledge. It is shown here how a careful understanding of a given biological pathway can refine an interactome 
approach. This permits the elucidation of additional design principles and of spatio-temporal dynamics behind pathways, 


and aids in experimental design and interpretation. 


lathrin-mediated endocytosis (CME) is an important ves- 

icle biogenesis pathway’*. Cargo is packaged with high 

fidelity into vesicles of defined size that are surrounded 

by a coat predominantly made of the protein clathrin and 
adaptor protein complexes. This coat is visible by electron micro- 
scopy and intermediate stages of vesicle formation have been docu- 
mented (Fig. la)’. An extensive protein machinery works alongside 
the scaffolding clathrin and adaptor protein complexes to build a 
vesicle (Fig. 1b and Supplementary Tables). 

The CME pathway was quickly recognized to be important for 
nutrient and growth factor entry into cells, and, despite its cargo spe- 
cificity and regulation, some toxins and viruses can still hijack it to gain 
entry into cells'*. At the synapse, clathrin-coated vesicles (CCVs) par- 
ticipate in retrieval of synaptic vesicles following exocytosis, in which 
potentially multiple copies of more than 10 different transmembrane 
proteins must be packaged, in appropriate stoichiometries, into vesi- 
cles of uniform size to maintain synaptic transmission’. 

We define CME as a ‘pathway’ because between the input (cargo) 
and the output (cargo-containing vesicle) there is an organized set of 
intermediate sub-reactions. In this analysis when we refer to a path- 
way we mean a macroscopic series of discrete events with a functional 
outcome and we do not mean higher order functions like cell division 
or vesicle trafficking, which are composed of orchestrated interac- 
tions of one or more pathways. 


The CME interactome 


At the morphological level, CME is seen as the pathway of cargo 
recruitment into patches of membrane, which become progressively 
invaginated to form CCVs. These vesicles are subsequently detached 
from the parent membrane and then uncoated (Fig. la). On closer 
inspection, this linear depiction is an oversimplification, because at 
the molecular level there are many micro-reactions occurring simul- 
taneously in this pathway. For example, cargo recruitment, mem- 
brane bending and clathrin-polymerization all happen in parallel in 
the early stages of vesicle formation’. 

Appreciating the considerable variety of proteins known to be 
involved in CME, we set out to learn about the pathway from a 
network approach. Using biochemical, structural and proteomic 
data®’, the major protein interactions for CME can be organized 
as a pathway protein interactome (Fig. 1b)*’° giving us the ‘CME- 
interactome’. This may not be the complete description of all possible 
interactors but, given the intensity of work on CME, it is likely that a 
very large percentage of protein-protein interactions are already 
described and therefore presented in this interactome. 


The basic elements of the interactome (individual proteins and 
lipids, and so on) are called nodes and their interactions are depicted 
as links connecting the nodes'*-"’. Stable protein complexes, like the 
adaptor protein complex AP2 or a clathrin triskelion, are considered 
as single nodes (a position that has support in the literature'®). To 
emphasize the core machinery of endocytosis we peeled away inter- 
actions that point outwards from the pathway to distinct processes 
like exocytosis and actin polymerization (Fig. 1b, grey-shaded areas). 
Thereby we distinguish our pathway from other network modules’” 
that can be connected pathways (for example, exocytosis) or modular 
attachments (‘plug-ins’). Such attachments can be used in many 
diverse pathways (for example, actin polymerization associated with 
vesicle scission and motility'*°). Also, for simplicity, in our CME 
interactome we concentrated on protein nodes. Changes in second 
messengers like calcium, in phosphorylation levels, and in membrane 
lipid composition could also be described as nodes in the pathway 
and changes to these components would have wide implications for 
this and other trafficking pathways. Disregarding the overall regu- 
lation is crucial for identifying the core machinery. These plots delib- 
erately underestimate the ways in which pathways are directly or 
indirectly sensitive to other pathways in the cell when embedded in 
a larger network. 

The first thing that becomes apparent within this newly generated 
pathway-network is that clathrin and AP2 have disproportionately 
more interactions than other proteins in this pathway and so fall into 
the definition of being ‘hubs’'*”’ (see Fig. 1b). 

Hubs have been subdivided into ‘party’ and ‘date’ subtypes”’. Date 
hubs are biomolecules that have many partners but interact with 
them at different times or locations (dynamic hubs), whereas party 
hubs can interact with many biomolecules at once (static hubs). Date 
hubs are proposed to organize the proteome, connecting biological 
processes to each other, whereas party hubs are proposed to function 
within modules (which may be pathways). Although recognizing the 
descriptive benefits of ‘party hub’ and ‘date hub’ nomenclature we 
call clathrin and AP2 ‘pathway hubs’ because this does not immedi- 
ately ascribe any behaviour to these hubs, but simply states that they 
appear within the studied pathway. 


CME pathway hubs 


The AP2 hub belongs to the family encoding heterotetrameric 
adaptor protein complexes. Such adaptor protein complexes have a 
trunk domain that binds to cargo and lipids, and two appendage 
domains, positioned on flexible linkers that bind to accessory pro- 
teins via two distinct binding sites on each appendage (Fig. 2a)”*”?*. 


'MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 OQH, UK. 
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It is these appendage domains that form the protein interaction 
surface for the many accessory proteins (for binding motifs see 
Table 1 and Supplementary Tables), when concentrated in forming 
coated pits'®. Adaptor protein complexes do not self-polymerize and 
so concentration and stabilization of adaptor protein complexes is 
via binding partners (as shown schematically for the binding partner 
Eps15; Fig. 2a). 

Clathrin is the second hub in the network (Fig. 1b). Clathrin binds 
to most of its interaction partners via a B-propeller domain found at 
the end of each leg of the clathrin triskelion (Fig. 2b). During coated 
vesicle formation, clathrin becomes concentrated by interactions 


both with adaptor protein complexes and with accessory proteins®”’. 
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Figure 1| Clathrin-coated vesicle formation: molecular detail versus a 
network view. a, Budding of a CCV. The initial membrane invagination 
occurs in concert with cargo recruitment and clathrin polymerization. 
Accessory proteins, such as AP180/CALM and epsin, are responsible for 
clathrin recruitment and size-constrained polymerization as well as 
membrane bending. The AP2 adaptor protein complex links the plasma 
membrane cargo into the forming clathrin coat. Dynamin is recruited by the 
accessory proteins amphiphysin, sorting nexin 9 and/or intersectin to the 
neck of the vesicle to enact membrane scission on GTP hydrolysis, whereas 
auxilin and the ATPase HSC70 are involved in the uncoating. The stages 
depicted are also evident in electron microscopy pictures taken from the 
work on yolk protein endocytosis in chicken* and reproduced with 
permission of the Company of Biologists. b, The CME interactome: the 
protein interactions occurring during coated vesicle formation. We have 
grouped many proteins with similar functions together, for example some of 
the kinases involved in CME (AAK and CVAK) and alternative cargo 
adaptors (Dab2 and ARH) are illustrated as single nodes. Different kinds of 
cargo molecules are also illustrated as one node, which should not lead to the 
assumption that cargo is a hub. This type of plot shows that AP2 and clathrin 
are the most common protein interaction points in the network, and 
therefore are of importance in the organization of the network, and these are 
surrounded by accessory proteins. The CME interactome is likely to be 
slightly different in each cell type and will be adapted to the ligands to be 
endocytosed and the speed with which this is required. Also, differences 
between species are easy to accommodate within the network, given its 
flexibility. 
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Table 1| Domain structures and functions of CME proteins 
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and  AP2 appendage binding motifs 


100 amino acids X Clathrin binding sites 


Hub proteins are conserved; but where there are multiple isoforms of accessory proteins in 
mammals and fish, one seems to be sufficient in invertebrates, even in those with nervous 
systems. A more comprehensive version of this Table is given in Supplementary Tables 1 and 2. 
Clathrin interaction motifs (X) are incomplete as they are difficult to detect given their sequence 
variations. Question marks indicate uncertainty. When we do not find clathrin or adaptin 
interaction motifs in homologues then we generally assume that the protein is not involved in 
CME. For amphiphysin, we know that a Drosophila form of amphiphysin does not have any 
clathrin or adaptor interaction and does not function in CME and so is not annotated in this Table 
(see asterisk). ANTH, API8O N-terminal homology; AP180, adapter protein 180 kDa; ARH, 
autosomal recessive hypercholesterolaemia; BAR, Bin/amphiphysin/Rvs; CALM, clathrin 
assembly lymphoid myeloid leukaemia protein; C. elegans, Caenorhabditis elegans; Dab2, 
disabled 2; D. melanogaster, Drosophila melanogaster, D. rerio, Danio rerio; EH, Eps15 homology; 
ENTH, epsin N-terminal homology; Eps15/R, epidermal growth factor receptor pathway 
substrate 15/related; P. falciparum, Plasmodium falciparum; PRD, proline rich domain; PTB, 
phosphotyrosine binding; R. norvegicus, Rattus norvegicus; SH3, Src homology 3; S. purpuratus, 
Strongylocentrotus purpuratus; UIM, ubiquitin interacting motif. 


Concentrated clathrin self-polymerizes into a lattice, forming an 
interaction surface (beneath the outer shell of clathrin legs) made 
up of many B-propeller domains (Fig. 2b), allowing the lattice to bind 
to many adaptor protein complexes and accessory proteins simulta- 
neously. This, and clathrin’s interaction with itself, provides the 
required stability to form the budding vesicle and is also likely to 
be important in the concentration of cargo into the coated pit and in 
the binding of proteins involved in uncoating. 


Functional versus connectivity view of CME 


Interactomes give a static picture of pathways that we know to be 
dynamic”. Thus we attempt to show the changes in connectivity 
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associated with the different stages of vesicle formation (Fig. 3). In the 
static interactome (Fig. 1b), we present a network for plasma mem- 
brane CCV formation where AP2 is the major heterotetrameric 
adaptor complex involved in cargo binding”’. We now further focus 
on one form of plasma membrane CCV formation—that of synaptic 
vesicle retrieval in nerve terminals*°—to emphasize the dynamics of 
what can happen in one vesicle, but the dynamics of the network will 
be similar for all types of CCV formation (Fig. 3). 

To form a cargo-containing vesicle, cargo needs to be concentrated 
at patches on the membrane of the appropriate lipid environment. 
This is likely to happen by the clustering of adaptor protein complexes 
bound to phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5)P>) on 
the plasma membrane” (Fig. 3a, functional view), which become 
stabilized by accessory proteins that bind multiple AP2s simulta- 
neously”® and also aid in recruitment of clathrin. Clathrin polymeri- 
zation leads to coat formation. AP2 loses its importance at this stage 
because clathrin now drives vesicle formation (Fig. 3c, functional 
view). As soon as the formation of the coated vesicle is completed, it 
is detached from the parent membrane, and subsequently uncoated 
(Fig. 3d, functional view). 
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Figure 2 | Structural organization of the hubs in CME. a, The major adaptor 
protein complex in brain-derived CCVs is the AP2 complex. This is 
composed of four subunits («, 82, 12, 02). Membrane cargo interacts 
predominantly with the j1-subunit (in red), whereas the appendage domains 
(orange and green), are responsible for binding to accessory proteins (four 
different peptide interactions are highlighted by dotted lines). These 
domains are positioned on long flexible linkers and thus they are perfectly 
designed to be protein recruitment domains bringing the needed partners 
from the cytosol to the forming coated pit. The right panel shows a 
representation of a cargo-containing plasma membrane patch 
accommodating closely packed AP2 complexes, generating a dense 
appendage protein interaction surface. AP2s are stabilized by clustering 
molecules such as Eps15, which have been shown to bind up to four 
appendages simultaneously. b, A clathrin triskelion consisting of three 
clathrin heavy and light chains. Clathrin interacts with ligands via its 
N-terminal B-propeller domain. When clathrin self-polymerizes into a cage 
(right panel), these domains form a dense interaction surface composed of at 
least 180 B-propeller domains per vesicle’. From a structural point of view, 
these propellers are tucked underneath the forming cage (unlike the adaptor 
protein complex appendage domains that point away from the vesicle) and 
are thus more likely to have an organizational role in coated pit formation, 
rather than a protein recruitment role. The clathrin cage on the right is not 
modelled on a 50 nm vesicle but comes from cryo-electron microscopy 
reconstruction of isolated cages”. The protein data bank accession numbers 
for the structures are as follows: «-appendage, 1W80; B-appendage, 2IV8, 
2Iv9; AP2 core, 1GW5; clathrin B-propeller, 1C9L and 1UTC. 
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Figure 3 | The steps of synaptic vesicle CCV formation. Functional and 
connectivity views of CCV formation in nerve terminals. In the network view 
of Fig. 1b, we included all plasma membrane accessory proteins, but here we 
are concerned with the changing interactions during synaptic CCV 
formation (non-synaptic CCV nodes are in light grey). a, AP2 recruitment to 
membranes, by cargo and lipid interactions. b, Building the AP2 hub 
assembly zone: the AP2 pathway hub is stabilized by cross-linking through 
accessory proteins. ¢, Changing hubs: clathrin is recruited and 
polymerization stabilizes the forming vesicle, displacing the AP2 
recruitment hub and becoming the new organizing hub. d, Assembled 
dynamin and other late-stage interaction partners (like uncoating factors) 
begin to function. 
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Connectivity interactomes are a different presentation of pathway 
progression, emphasizing the dynamic nature of the CME-network 
(Fig. 3, connectivity view). With each step of CME there is a corres- 
ponding change in protein connectivity. The increasing number of 
purple lines from Fig. 3a to 3b (connectivity view) illustrates the 
increasing number of interactions AP2 can achieve by being clustered 
on the membrane. One AP2 molecule can at most interact with four 
accessory proteins at once, one clathrin molecule, and a limited 
number of cargoes and lipid head groups. By assembling many 
AP2s at the membrane these interactions expand and lead to the 
fully engaged interactome. We would suggest that AP2 only 
functions as a hub when it is concentrated, forming an AP2 hub 
assembly zone’®. 

Many of the accessory proteins successfully recruited by AP2 to the 
site of endocytosis also have interactions with each other and with 
lipids (yellow lines), further stabilizing this intermediate and allow- 
ing the recruitment of proteins that are present in lower concentra- 
tions and that bind with lower affinities. In addition, many accessory 
proteins have clathrin interaction sites and therefore they can par- 
ticipate (with AP2) in clathrin recruitment and concentration, lead- 
ing to polymerization. Because there is steric overlap between most 
clathrin and AP2 sites in accessory proteins, AP2 loses its significance 
and there is a rewiring of the network because the newly formed 
clathrin hub takes over from the AP2 hub as the organizing centre 
(red lines to polymerized clathrin; Fig. 3c, connectivity view). Finally 
the conditions are ripe for the assembled scission molecule dynamin 
and subsequently the uncoating molecule auxilin to function (Fig. 
3d). More information on predicting the timescale of events from 
topological properties of interactomes can be found in Supplemen- 
tary Information Part 1 and Supplementary Fig. 2. 


What we learn from integration 


Plotting the CME pathway protein interactome allows one to imme- 
diately identify the key players (hubs) and approximately where they 
work in the pathway. 

Knowing what status a protein has in the network (hub versus a 
regular node) allows us to predict experimental outcomes. It has 
previously been suggested that depletion of proteins with many inter- 
action partners (by RNA interference) is more likely to give pheno- 
types**. Complete depletion of a pathway hub will probably eliminate 
the pathway. Partial depletion of the AP2 hub affects AP2-specific 
cargo internalization”’. Additional modules (alternative cargo adap- 
tors) lead to robustness in the network allowing alternative cargoes to 
be taken up with less or no dependence on AP2. Depletion of regular 
nodes is likely to be less deleterious than depletion of hubs. Depletion 
of a protein positioned in-between pathways (like dynamin, see 
below) may well inhibit many different pathways*””’. 

We would suggest that a good test for proteins that function as 
hubs only when clustered in space and time is that overexpression of 
such a protein should not have any phenotype on the pathway. 
Conversely, overexpression of nodes that bind directly to hubs would 
be predicted to have disastrous affects owing to titration of hub 
interaction points (see Supplementary Information Part 2, for fur- 
ther discussion). 


Fine-tuning hub nomenclature 


We describe the pathway hubs, AP2 and clathrin, as the control 
centres of the CME pathway. We do not ascribe hub status to dyna- 
min in the CME interactome because it has only four links into this 
pathway. However on the Human Protein Reference Database 
(http://www.hprd.org) dynamin is reported to have approximately 
50 interaction partners. Thus, in a proteome network, dynamin 
would also be a hub and would seem to connect many different 
pathways together. This is misleading, because from a functional 
point of view dynamin is simply a module (‘plug-in’) that is attached 
to many different pathways in a cell because of its function as a 
scission molecule, and it is not the connection point between these 
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pathways. According to the network nomenclature”, dynamin 
would be called a date hub, but in our view these hubs do not neces- 
sarily organize the proteome. On this point we would urge caution 
because the network nomenclature has preceded a functional under- 
standing of hubs. In a network, the node for dynamin should be 
present in as many copies as there are pathways in which it is 
involved, and each of these nodes should not be linked to each other. 

We would suggest that hubs could alternatively be named accord- 
ing to where they are positioned in a pathway at a given time, rather 
than the number and type of interactions. The notion that deletion of 
party/pathway hubs has much more limited effects than deletion of 
date hubs” can now be explained, because deletion of date-hubs will 
lead to more severe effects on cell viability owing to the additive result 
of disrupting a frequently used module (for example, dynamin**”’) 
and thus affecting disparate pathways, whereas deletion of a pathway 
hub (for example, AP2; refs 29, 33) will have more limited conse- 
quences (see also ref. 34 and Supplementary Information Part 2). 

Hubs in proteomes are generally not directly connected to other 
hubs™ and this may mean that the organization of each pathway is 
independent, with the loss of one hub not affecting other pathways in 
the cell’’. A more recent observation was that party hubs, being the 
core of highly clustered functional modules, are frequently directly 
connected”’. This fits with the direct connection between AP2 and 
clathrin in the CME interactome. A useful functional feature of this 
direct connection is seen in the biochemical observation that the 
clathrin hub displaces the AP2 hub interaction partners and thus 
ensures directionality to the dynamic protein interactions occurring 
during CCV formation’®. 

One important property of the pathway hubs AP2 and clathrin is 
that neither are hubs at the beginning of CME, but mature into hubs 
by clustering either on the membrane or through polymerization. It 
is likely that many pathway/party hub proteins will oligomerize or 
cluster to function as pathway hubs. ‘Clustered hubs’ are a new sub- 
type of hubs not previously described (Supplementary Information 
Part 3 and Supplementary Fig. 3). 


Consequences of hub-centric pathways 


The hub-centric organization of a pathway has at least four major 
consequences, described below. 

(1) Ease of module attachment. An appealing feature of a network is 
the ease with which modules of functionality can be attached/evolve. 
Hubs act as the control points where these modules can be plugged 
into the system to give extra capabilities. In the CME interactome, we 
find additional alternative cargo adaptors/clathrin-associated sorting 
proteins (CLASPs) (see Supplementary Tables) added to the network 
by binding to the AP2 and clathrin hubs, when specific cargo needs to 
be endocytosed®'**°. Thus by adding a small module, the repertoire 
and potentially independent regulation of this specialist cargo*® is 
accommodated while still using the same overall network. Not all 
pathways will have such complicated networks. For example, in 
COPI and COPII vesicle budding the network of interactions is much 
simpler**. Simpler and less flexible networks should have hubs with 
fewer connecting partners and not all hubs can be expected to be 
regulated by oligomerization. Indeed post-translational modifications 
such as phosphorylation could well be regulators. Phosphorylation 
controls the overall flux through CME at the synapse by influencing 
the ability to form protein networks and so phosphorylation in this 
context is a master regulator of CME, but it is also clear that phos- 
phorylation can fine-tune individual steps’”*””*. 

Modularity can also be seen from looking at the conservation of 
the CME network across species. We observe that most participating 
proteins are conserved in the animal kingdom (Table 1 and Supple- 
mentary Information Part 4). Species evolutionary distant from 
mammals, like the malaria parasite Plasmodium falciparum, tend to 
only conserve the hub proteins and not other nodes, an observation 
previously made more generally from proteome analysis”. 
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(2) Dynamic instability. In a network of interacting proteins most 
interactions are of necessity low affinity, and from biochemical mea- 
surements of affinities we know that many individual steps are 
dependent on coincidence of multiple interactions'®’’. Even the initial 
event of adaptor protein complexes binding to cargo is dependent on 
the presentation of binding sequences in membrane cargo and bind- 
ing of the adaptor protein complexes to a specific lipid. Coincidence is 
seen at all stages in the process because affinities are frequently low 
enough that multiple interactions are required to give an avidity that 
can allow an interaction to occur. This gives rise to a dynamic instab- 
ility in the network, because the network assembly will depend on 
whether sufficient adaptor protein complexes are recruited and suf- 
ficient accessory proteins are present to crosslink and organize these 
molecules and, subsequently, whether sufficient clathrin is recruited 
to stabilize the forming coated pit. 

(3) Pathway progression. Dynamic instability leads to the question 
of how directionality can be imposed on the overall pathway of CCV 
formation. At some point there must be commitment to coated pit 
assembly and the pathway needs directionality to reach the end prod- 
uct of a free vesicle. 

In CME, we are concerned with a network of interactions that can 
only form when the first hub protein AP2 is recruited and concen- 
trated on the membrane at sites for endocytosis. The strengths of 
network interactions around clustered AP2s are described as avid- 
ities, as they are the combination of many different interactions. To 
progress in vesicle formation, these avidities have to be overcome. 
Normally, this would be overcome with a still stronger interaction, 
or by weakening the previous interaction (perhaps via post- 
translational modification). In CME, clustered adaptor protein 
complexes, cross-linked by high avidity interactions with accessory 
proteins, need to progress to a mature CCV. In immunofluorescence 
experiments, it is observed that clathrin is recruited and polymerized 
around nascent vesicles while accessory proteins (for example, eps15; 
ref. 6) are lost. This is reinforced by the absence of most accessory 
proteins from purified CCVs*°. Biochemical experiments on the 
other hand tell us that clathrin competes poorly with accessory pro- 
teins for AP2 (refs 6, 8, 12) and so it would seem that clathrin is 
unlikely to be responsible for this loss of accessory proteins. The 
answer may lie in the different polymerization states of clathrin: in 
the biochemical competition experiments, clathrin is not polymer- 
ized but on the vesicle, clathrin is polymerized (Fig. 3b, c). Poly- 
merized clathrin will have different properties and needs to be viewed 
as a semi-solid matrix. Neither affinity nor avidity interactions 
describe the interactions of this matrix with its binding partners 
sufficiently, and therefore we have previously described this mass- 
action as ‘matricity’. Matricity describes a situation in which the 
matrix does not have traditional diffusion kinetics, and thus a very 
low-affinity interaction appropriately positioned may well be highly 
significant. Thus, the network moves from affinity-based interaction, 
to avidity-based interaction, to matricity-based interaction. This 
gives directionality to the pathway (Supplementary Fig. 4). 

One problem posed by this increasing stability of the CME inter- 
actome is that eventually components have to be recycled and reused 
for the next CCV formation. It is at this point that ATP hydrolysis by 
the uncoating proteins gives the required injection of energy to re- 
prime the molecules so that they can start the cycle again, when 
required. Of course, there are many other required inputs of energy 
into the system, such as in making the appropriate lipids, but it 
is interesting that this major input of energy occurs after vesicle- 
formation is committed (dynamin GTP hydrolysis) or completed 
(ATP hydrolysis by HSC70 during uncoating) (Fig. 3d). In exocyto- 
sis, we also see a network of protein interactions in which ATP 
hydrolysis is mostly seen after vesicle fusion has occurred, and this 
is used to reprime the molecules for the next round of fusion. This use 
of ATP late in the cycle also adds to the dynamic instability of the 
pathway. 
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(4) Fidelity and flexibility. In CME, there is an interesting obser- 
vation that hubs, AP2 and clathrin, bind to overlapping partners, and 
both hubs compete with each other for these partners. Evolution has 
neither resulted in these hubs being dispensed with, nor combined 
them into one, because the AP2 hub also has a specialized function of 
specific cargo interaction and clathrin has the specific cage-forming 
function. If these hubs were to become one, then the pathway would 
lose the dynamic instability, and hence the directionality and flexi- 
bility. We argue that AP2 binding modules must surely result in 
the expansion of cargo recruitment possibilities and if cells had no 
adaptors, but clathrin were to include a cargo binding domain, then 
this would reduce the flexibility of the cargo repertoire. Also, if 
clathrin were to polymerize on cargo binding then the process is 
more likely to cascade in a forward direction because clathrin self- 
polymerizes, forming empty cages. Natural selection has, instead, 
resulted in non-self-polymerizing modules to capture cargo, and a 
non-membrane-binding module to form the polymerized coat 
around this module. Thus the pathway has in-built fidelity. 


Conclusion 


Networks and interactome pathway maps are useful tools to help 
explain a biological pathway, but they need to be anchored in bio- 
logical contexts and experimental fact. This can initially be done by 
embedding interactomes within well-characterized pathways and 
using this information to inform us in other pathway investigations. 
We have made many observations about the nature of the CME 
interactome—about what constitutes a hub, the modularity of the 
system, the possible evolution of the network, and about dynamic 
instability, directionality and robustness within networks. We need 
to understand more about how proteins function together and are 
organized into pathways, and how these pathways are integrated into 
yet larger contexts. For this, a fresh perspective and a vision for 
conceptual thinking, coupled with technical advances to map out 
dynamic interactomes*' and in visual immunoprecipitation”’, is 
needed, which ultimately will lead to a better understanding of com- 
plex biological processes. 
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MAGNETOELECTRICS 


Is CdCr,S, a multiferroic relaxor? 


Arising from: J. Hemberger et al. Nature 434, 364-367 (2005) 


Materials showing simultaneous ferroelectric 
and magnetic ordering are attracting a great 
deal of interest because of their unusual phys- 
ics and potential applications’. Hemberger et 
al.” have reported relaxor-like dielectric prop- 
erties and colossal magnetocapacitance (in 
excess of 500%) for the cubic spinel compound 
CdCr,S, and related isomorphs*’, conclud- 
ing that CdCr,S, is a multiferroic relaxor. We 
argue here, however, that their results might 
also be explained by a conductive artefact. 

The polarization hysteresis loops of Hem- 
berger et al.” have a shape unlike that seen 
in ferroelectrics and more like that of a 
lossy dielectric®’, such as the semiconduct- 
ing thiospinels. The ‘remanent’ polarization 
is much larger than the pyroelectric one, 
whereas switchable polarization must always 
be smaller than the pyroelectric polarization 
— which on its own calls the hysteresis results 
into question. The pyroelectric polarizations 
of 0.05-0.07 uC cm~ are themselves tiny for 
any proper ferroelectric, and are consistent 
with thermally stimulated discharge of space- 
charge injected during poling’. 

Ab initio calculations*® preclude soft pho- 
nons, and hence proper ferroelectricity. Nor 
have Raman experiments’ found soft phonons 
or a lack of centrosymmetry. Raman results 
with wavelengths near or below the resonant 
edge’ are consistent with local non-centro- 
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Figure 1| Temperature of dielectric maxima as a 
function of alternating-current frequency. Data 
were extracted from refs 2—4 and fitted with the 
Vogel-Fulcher law of relaxors. The good fit (blue 
line) gives a negative value, whereas the T;, (the 
finite temperature at which the relaxor dynamics 
‘freezes into a polarizable state) suggested by 
Hemberger et al. (red line) does not reproduce 
the data. If T; is not positive, there can be no 
remanent polarization. If an Arrhenius law 

is fitted to the dielectric peaks, the activation 
energy (about 0.33 eV) (refs 3-5) is similar 

to that obtained from fitting the conductivity 
data (0.2-0.3 eV), indicating that the dielectric 
relaxation is indeed due to the temperature 
dependence of the conductivity. 
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symmetry (a necessary, although insufficient, 
condition for ferroelectricity) — however, this 
technique is unreliable because nominally for- 
bidden odd-parity longitudinal-optical pho- 
non scattering can be strong even in centric 
crystals’ when there is real absorption. 

Broad dielectric peaks occurring as a func- 
tion of temperature, and shifting to higher 
temperatures as the frequency of the alternat- 
ing current (a.c.) increases, are interpreted by 
Hemberger et al. as evidence that CdCr,§, is 
a relaxor ferroelectric’. In a relaxor, the tem- 
perature of the dielectric peak (T,,,) depends 
on the a.c. frequency, f, in accordance with the 
Vogel—Fulcher law: f=f,e“**\"™". Here, T; 
is the finite temperature at which the relaxor 
dynamics ‘freeze’ into a polarizable state. We 
have fitted the data for CdCr,S, (Fig. 1) and 
obtain an unrealistic value for T; of about 
-175 K. For polarization to be remanent at 
the temperatures suggested by Hemberger et 
al., T,-ought to be in the region of 60 K, which 
does not fit the data (Fig. 1). 

Relaxor-like behaviour can actually appear 
in any heterogeneous semiconductor as a result 
of the Maxwell-Wagner effect, a conductive 
artefact unrelated to relaxors’””’. The low-fre- 
quency divergence of the imaginary permittiv- 
ity’ is indeed consistent with direct-current 
(d.c.) conductivity in a Maxwell-Wagner sys- 
tem'’. Furthermore, any Maxwell-Wagner 
system (any heterogeneous semiconductor) 
that is magnetoresistive can show magnetoca- 
pacitance™ (Fig. 2), and the thiospinels are all 
magnetoresistive* >. 

Other indicators of magnetocapacitance 
due to magnetoresistive artefacts’ are the 
large effect of magnetic fields on dielectric 
loss, the strong frequency dependence of the 
magnetocapacitance and its correlation with 
magnetoresistance*. The magnetocapaci- 
tance in fact exceeds that possible for linear 
magnetoelectric coupling’ and, whereas 
larger magnetocapacitance is possible for 
nonlinear coupling, a different temperature 
and field dependence would be expected; in 
the thiospinels it is neither proportional to the 
magnetization M or M’, nor correlated with 
the alleged onset of polarization. 

Hemberger et al. try to rule out contact 
artefacts by using different electrodes*”, 
but Maxwell-Wagner does not require con- 
tact-related depletion layers. The unusual 
dielectric features disappear on annealing’, 
indicating that there could be a heterogeneous 
distribution of defects that can be annealed 
out. The authors’ single crystals were grown 
with chlorine as a transport agent, so chlo- 
rine-based impurities are one possibility. The 
authors themselves acknowledge the possibil- 
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Figure 2 | Maxwell-Wagner equivalent circuit 
model of a heterogeneous lossy dielectric. The 
model shows approximated equations for the real 
part of the apparent dielectric constant (€’) at 
high and low frequencies of alternating current. 
At high frequencies, the two types of region act as 
two capacitors in series (C1 and C2). The effect 

of lowering the frequency is the same as that of 
decreasing the resistance (R) of one of the two 
components: it allows the current (arrows) to flow 
through, reducing the apparent thickness and 
increasing the measured capacitance (C). In the 
thiospinels, applying a magnetic field has the same 
effect on capacitance as reducing the frequency’, 
consistent with negative magnetoresistance in a 
Maxwell—Wagner system". 


ity of sulphur non-stoichiometry”, so that is 
another possible culprit. 

The greater message here is that magneto- 
capacitance is not a property of a material, 
but an impedance measurement of a circuit; 
it may reflect the intrinsic magnetodielec- 
tric properties of the crystal under study’, or 
extrinsic parasitic contributions. Artificial 
magnetoelectric effects are common and do 
not require that the material be multiferroic. 
Because these artefacts are more likely in semi- 
conducting samples, such as the spinels, and 
because all reported data are consistent with a 
magnetoresistive Maxwell-Wagner effect, this 
seems to be the most plausible explanation. At 
a minimum, our analysis shows that evidence 
for the spinels being true relaxor multiferroics 
is at present inconclusive. 

Gustau Catalan, James F. Scott 
Department of Earth Sciences, University 
of Cambridge, Downing Street, 
Cambridge CB2 3EQ, UK 

e-mail: gcatO5@esc.cam.ac.uk 
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Hemberger et al. reply 


Replying to: G. Catalan & J. F. Scott Nature 448, doi: 10.1038/nature06156 (2007) 


Catalan and Scott’ propose an alternative 
interpretation for our findings’ for CdCr,S, 
in terms of Maxwell-Wagner effects. They 
also quote related isomorphs, such as HgCr,S, 
(ref. 3), which has been discussed elsewhere”. 
As we have shown, Maxwell-Wagner relaxa- 
tions can indeed strongly affect the dielectric 
properties of transition-metal oxides®’ and 
semiconductors*. We do not find Catalan and 
Scott’s arguments about CdCr,S, convincing, 
however, and think that our experiments pro- 
vide ample evidence for multiferroic relaxor 
behaviour. 

Taking the arguments of Catalan and Scott 
point by point, we explain why we stand by our 
interpretation. First, lossy dielectrics reveal 
purely elliptical hysteresis loops. The observed 
hysteresis is typical for relaxors when satura- 
tion is not reached’. CdCr,S, is not a proper 
ferroelectric. Because of the history and time 
dependence of relaxors’, the thermo-rema- 
nent polarization cannot be compared with 
results from P(E) loops. We excluded space- 
charge effects by frequency-dependent P(E) 
measurements’. 

Second, coupled spin and polarization 
waves, not soft phonons, are the relevant col- 
lective excitations in multiferroics”"’. Recent 
Raman experiments” revealing local polar 
distortions cannot be explained by resonance 
effects: those data were determined with 
energy that is much larger than the red-shifted 
temperature-dependent interband excitation. 
Consequently, the effect of local distortions on 
the longitudinal-optical intensity is larger than 
possible resonance contributions. Further- 
more, the different temperature dependencies 
of the longitudinal-optical mode intensities, 
the absent multiphonon and fluorescent con- 
tributions, and small phonon linewidth speak 
against strong resonance contributions. 

In their Fig. 1, Catalan and Scott analyse the 
peak positions of the temperature-dependent 


dielectric constant, e’, reported by us”. Such 
data do not provide precise information on 
relaxation dynamics and are hampered by 
conductivity contributions in semiconduct- 
ing CdCr,S,. Figure 1 presents more accurate 
information on the relaxation rate’. At high 
temperatures, a reasonable description with 
T; ~ 60 K is possible. An analysis based on 
e’(T), as performed by Catalan and Scott ‘and 
including the transition regime at low temper- 
atures, leads to an unrealistic value for T;. 

The low-frequency divergence of the loss is 
a common feature of materials with significant 
conductivity. The circuit discussed by Catalan 
and Scott’ cannot account for the complex 
behaviour of CdCr,S, and more elaborate net- 
works®’ would be needed. Magnetoresistance 
is a necessary but insufficient condition for 
Maxwell-Wagner-related magnetocapacitance. 
The field changes the relaxation dynamics””’, 
which explains the effect on the relaxation- 
related loss and the frequency-dependent 
magnetocapacitance. The correlation between 
magnetocapacitance and magnetoresistance 
arises because both are triggered by the strong 
field-induced shift of the ferromagnetic tran- 
sition temperature. Concerning the magni- 
tude of the magnetocapacitance, CdCr,S, is 
not a conventional multiferroic and cannot 
be treated with the same theoretical footing’. 
The dynamic nature of the effect”’’ explains 
the lack of correlation between polarization 
and magnetocapacitance. 

From electron probe microanalysis and 
the narrow linewidths in X-ray studies using 
synchrotron radiation, we exclude any non- 
homogeneous impurity distribution in our 
single crystals. Meanwhile, we have repro- 
duced all effects in annealed single crystals 
grown without chlorine and in ceramics 
doped with indium. 

We therefore think that our experiments 
provide evidence for intrinsic multiferroic 
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Figure 1| Temperature dependence of the 
relaxation rate of CdCr,S,. The relaxation rate, f, 
was calculated from the relaxation times T, shown 
in ref. 13, determined from fits of the frequency- 
dependent complex permittivity. These data are 
more accurate than those analysed in Fig. 1 of ref. 
1. As revealed by the Arrhenius representation 

of Fig. 1, at high temperatures the data follow 
thermally activated behaviour”’, but they also can 
be fitted by a Vogel-Fulcher law, with T; = 60 K 
(line). At low temperatures, strong ferromagnetic 
fluctuations’ lead to deviations owing to an 


acceleration of the relaxation dynamics””. 


behaviour, supporting early assumptions of 
polar phase transitions and magnetoelectric 
effects in spinels”. 

Joachim Hemberger*, Peter Lunkenheimer*, 
Robert Fichtl*, Hans-Albrecht Krug von 
Nidda*, Vladimir Tsurkan*+, Alois Loidl* 
*Experimental Physics V, Centre for Electronic 
Correlations and Magnetism, University of 
Augsburg, 86159 Augsburg, Germany 

e-mail: peter.lunkenheimer@physik.uni- 
augsburg.de 

yInstitute for Applied Physics, Academy of 
Sciences of Moldova, MD-2028, Chisinau, 
Moldova 


1. Catalan, G. & Scott, J. F. Nature 448, doi:10.1038/ 
nature06156 (2007). 
2. Hemberger, J. et al. Relaxor ferroelectricity and colossal 


ES 


BRIEF COMMUNICATIONS ARISING 


magnetocapacitive coupling in ferromagnetic CdCr,S,. 6. 


Nature 434, 364-367 (2005). 
3. Weber, S. et al. Colossal magnetocapacitance and 
colossal magnetoresistance in HgCr,S,. Phys. Rev. Lett. 7. 
96, 157202 (2006). 
4. Catalan, G. & Scott, J. F. Comments on “Giant 
dielectric response in the one-dimensional charge- 
ordered semiconductor (NbSe,)3I" and “Colossal 


magnetocapacitance and colossal magnetoresistance 8. 


in HgCr,S,. Preprint at arXiv:cond-mat/0607500 
(2006). 
5. Lunkenheimer, P. et al. Reply to comments on “Giant 


dielectric response in the one-dimensional charge- 9. 


ordered semiconductor (NbSe,),|" and “Colossal 
magnetocapacitance and colossal magnetoresistance in 
HgCr,S,". Preprint at arXiv:cond-mat/0701417 (2007). 


E6 


Lunkenheimer, P., Fichtl, R., Ebbinghaus, S. G. & Loidl, A. 
Nonintrinsic origin of the colossal dielectric constants in 
CaCu,Ti,O,,. Phys. Rev. B70, 172102 (2004). 

Bobnar, V., Lunkenheimer, P., Paraskevopoulos, M. 

& Loidl, A. Separation of grain boundary effects and 
intrinsic properties in perovskite-like Gd, <Yp,BaCo,0.. 
using high-frequency dielectric spectroscopy. Phys. Rev. B 
65, 184403 (2002). 

Ritus, A. |. et al. Determination of the parameters of 
semiconducting CdF2:In with Schottky barriers from 
radio-frequency measurements. Phys. Rev. B65, 165209 
(2002). 

Samara, G. A. The relaxational properties of 
compositionally disordered ABO; perovskites. J. Phys.: 
Condens. Matter 15, R367-R411 (2003). 


10. Pimenoy, A. et al. Possible evidence for electromagnons in 


© 2007 Nature Publishing Group 


N 


w 


NATURE|Vol 448|23 August 2007 


multiferroic manganites. Nature Phys. 2, 97-100 (2006). 


. Katsura, H., Balatsky, A. V. & Nagaosa, N. Dynamical 


magnetoelectric coupling in helical magnets. Phys. Rev. 
Lett. 98, 027203 (2007). 


. Gnezdilov, V. et al. Evidence for local lattice distortions in 


giant magnetocapacitive CdCr.S,. Preprint at arXiv:cond- 
mat 0702362 (2007). 


. Lunkenheimer, P., Fichtl, R., Hemberger, J., Tsurkan, 


V. & Loidl, A. Relaxation dynamics and colossal 
magnetocapacitive effect in CdCr,S,. Phys. Rev. B 72, 
060103 (2005). 

Schmid, H. & Ascher, E. Are antiferroelectricity and other 
physical properties ‘hidden’ in spinel compounds? J. Phys. 
C: Solid State Phys. 7, 2697-2706 (1974). 


doi:10.1038/nature06157 


BRIEF COMMUNICATIONS ARISING 


MAGNETOELECTRICS 


Is CdCr,S, a multiferroic relaxor? 


Arising from: J. Hemberger et al. Nature 434, 364-367 (2005) 


Materials showing simultaneous ferroelectric 
and magnetic ordering are attracting a great 
deal of interest because of their unusual phys- 
ics and potential applications’. Hemberger et 
al.” have reported relaxor-like dielectric prop- 
erties and colossal magnetocapacitance (in 
excess of 500%) for the cubic spinel compound 
CdCr,S, and related isomorphs*’, conclud- 
ing that CdCr,S, is a multiferroic relaxor. We 
argue here, however, that their results might 
also be explained by a conductive artefact. 

The polarization hysteresis loops of Hem- 
berger et al.” have a shape unlike that seen 
in ferroelectrics and more like that of a 
lossy dielectric®’, such as the semiconduct- 
ing thiospinels. The ‘remanent’ polarization 
is much larger than the pyroelectric one, 
whereas switchable polarization must always 
be smaller than the pyroelectric polarization 
— which on its own calls the hysteresis results 
into question. The pyroelectric polarizations 
of 0.05-0.07 uC cm~ are themselves tiny for 
any proper ferroelectric, and are consistent 
with thermally stimulated discharge of space- 
charge injected during poling’. 

Ab initio calculations*® preclude soft pho- 
nons, and hence proper ferroelectricity. Nor 
have Raman experiments’ found soft phonons 
or a lack of centrosymmetry. Raman results 
with wavelengths near or below the resonant 
edge’ are consistent with local non-centro- 
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Figure 1| Temperature of dielectric maxima as a 
function of alternating-current frequency. Data 
were extracted from refs 2—4 and fitted with the 
Vogel-Fulcher law of relaxors. The good fit (blue 
line) gives a negative value, whereas the T;, (the 
finite temperature at which the relaxor dynamics 
‘freezes into a polarizable state) suggested by 
Hemberger et al. (red line) does not reproduce 
the data. If T; is not positive, there can be no 
remanent polarization. If an Arrhenius law 

is fitted to the dielectric peaks, the activation 
energy (about 0.33 eV) (refs 3-5) is similar 

to that obtained from fitting the conductivity 
data (0.2-0.3 eV), indicating that the dielectric 
relaxation is indeed due to the temperature 
dependence of the conductivity. 


E4 


symmetry (a necessary, although insufficient, 
condition for ferroelectricity) — however, this 
technique is unreliable because nominally for- 
bidden odd-parity longitudinal-optical pho- 
non scattering can be strong even in centric 
crystals’ when there is real absorption. 

Broad dielectric peaks occurring as a func- 
tion of temperature, and shifting to higher 
temperatures as the frequency of the alternat- 
ing current (a.c.) increases, are interpreted by 
Hemberger et al. as evidence that CdCr,§, is 
a relaxor ferroelectric’. In a relaxor, the tem- 
perature of the dielectric peak (T,,,) depends 
on the a.c. frequency, f, in accordance with the 
Vogel—Fulcher law: f=f,e“**\"™". Here, T; 
is the finite temperature at which the relaxor 
dynamics ‘freeze’ into a polarizable state. We 
have fitted the data for CdCr,S, (Fig. 1) and 
obtain an unrealistic value for T; of about 
-175 K. For polarization to be remanent at 
the temperatures suggested by Hemberger et 
al., T,-ought to be in the region of 60 K, which 
does not fit the data (Fig. 1). 

Relaxor-like behaviour can actually appear 
in any heterogeneous semiconductor as a result 
of the Maxwell-Wagner effect, a conductive 
artefact unrelated to relaxors’””’. The low-fre- 
quency divergence of the imaginary permittiv- 
ity’ is indeed consistent with direct-current 
(d.c.) conductivity in a Maxwell-Wagner sys- 
tem'’. Furthermore, any Maxwell-Wagner 
system (any heterogeneous semiconductor) 
that is magnetoresistive can show magnetoca- 
pacitance™ (Fig. 2), and the thiospinels are all 
magnetoresistive* >. 

Other indicators of magnetocapacitance 
due to magnetoresistive artefacts’ are the 
large effect of magnetic fields on dielectric 
loss, the strong frequency dependence of the 
magnetocapacitance and its correlation with 
magnetoresistance*. The magnetocapaci- 
tance in fact exceeds that possible for linear 
magnetoelectric coupling’ and, whereas 
larger magnetocapacitance is possible for 
nonlinear coupling, a different temperature 
and field dependence would be expected; in 
the thiospinels it is neither proportional to the 
magnetization M or M’, nor correlated with 
the alleged onset of polarization. 

Hemberger et al. try to rule out contact 
artefacts by using different electrodes*”, 
but Maxwell-Wagner does not require con- 
tact-related depletion layers. The unusual 
dielectric features disappear on annealing’, 
indicating that there could be a heterogeneous 
distribution of defects that can be annealed 
out. The authors’ single crystals were grown 
with chlorine as a transport agent, so chlo- 
rine-based impurities are one possibility. The 
authors themselves acknowledge the possibil- 
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Figure 2 | Maxwell-Wagner equivalent circuit 
model of a heterogeneous lossy dielectric. The 
model shows approximated equations for the real 
part of the apparent dielectric constant (€’) at 
high and low frequencies of alternating current. 
At high frequencies, the two types of region act as 
two capacitors in series (C1 and C2). The effect 

of lowering the frequency is the same as that of 
decreasing the resistance (R) of one of the two 
components: it allows the current (arrows) to flow 
through, reducing the apparent thickness and 
increasing the measured capacitance (C). In the 
thiospinels, applying a magnetic field has the same 
effect on capacitance as reducing the frequency’, 
consistent with negative magnetoresistance in a 
Maxwell—Wagner system". 


ity of sulphur non-stoichiometry”, so that is 
another possible culprit. 

The greater message here is that magneto- 
capacitance is not a property of a material, 
but an impedance measurement of a circuit; 
it may reflect the intrinsic magnetodielec- 
tric properties of the crystal under study’, or 
extrinsic parasitic contributions. Artificial 
magnetoelectric effects are common and do 
not require that the material be multiferroic. 
Because these artefacts are more likely in semi- 
conducting samples, such as the spinels, and 
because all reported data are consistent with a 
magnetoresistive Maxwell-Wagner effect, this 
seems to be the most plausible explanation. At 
a minimum, our analysis shows that evidence 
for the spinels being true relaxor multiferroics 
is at present inconclusive. 

Gustau Catalan, James F. Scott 
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Hemberger et al. reply 


Replying to: G. Catalan & J. F. Scott Nature 448, doi: 10.1038/nature06156 (2007) 


Catalan and Scott’ propose an alternative 
interpretation for our findings’ for CdCr,S, 
in terms of Maxwell-Wagner effects. They 
also quote related isomorphs, such as HgCr,S, 
(ref. 3), which has been discussed elsewhere”. 
As we have shown, Maxwell-Wagner relaxa- 
tions can indeed strongly affect the dielectric 
properties of transition-metal oxides®’ and 
semiconductors*. We do not find Catalan and 
Scott’s arguments about CdCr,S, convincing, 
however, and think that our experiments pro- 
vide ample evidence for multiferroic relaxor 
behaviour. 

Taking the arguments of Catalan and Scott 
point by point, we explain why we stand by our 
interpretation. First, lossy dielectrics reveal 
purely elliptical hysteresis loops. The observed 
hysteresis is typical for relaxors when satura- 
tion is not reached’. CdCr,S, is not a proper 
ferroelectric. Because of the history and time 
dependence of relaxors’, the thermo-rema- 
nent polarization cannot be compared with 
results from P(E) loops. We excluded space- 
charge effects by frequency-dependent P(E) 
measurements’. 

Second, coupled spin and polarization 
waves, not soft phonons, are the relevant col- 
lective excitations in multiferroics”"’. Recent 
Raman experiments” revealing local polar 
distortions cannot be explained by resonance 
effects: those data were determined with 
energy that is much larger than the red-shifted 
temperature-dependent interband excitation. 
Consequently, the effect of local distortions on 
the longitudinal-optical intensity is larger than 
possible resonance contributions. Further- 
more, the different temperature dependencies 
of the longitudinal-optical mode intensities, 
the absent multiphonon and fluorescent con- 
tributions, and small phonon linewidth speak 
against strong resonance contributions. 

In their Fig. 1, Catalan and Scott analyse the 
peak positions of the temperature-dependent 


dielectric constant, e’, reported by us”. Such 
data do not provide precise information on 
relaxation dynamics and are hampered by 
conductivity contributions in semiconduct- 
ing CdCr,S,. Figure 1 presents more accurate 
information on the relaxation rate’. At high 
temperatures, a reasonable description with 
T; ~ 60 K is possible. An analysis based on 
e’(T), as performed by Catalan and Scott ‘and 
including the transition regime at low temper- 
atures, leads to an unrealistic value for T;. 

The low-frequency divergence of the loss is 
a common feature of materials with significant 
conductivity. The circuit discussed by Catalan 
and Scott’ cannot account for the complex 
behaviour of CdCr,S, and more elaborate net- 
works®’ would be needed. Magnetoresistance 
is a necessary but insufficient condition for 
Maxwell-Wagner-related magnetocapacitance. 
The field changes the relaxation dynamics””’, 
which explains the effect on the relaxation- 
related loss and the frequency-dependent 
magnetocapacitance. The correlation between 
magnetocapacitance and magnetoresistance 
arises because both are triggered by the strong 
field-induced shift of the ferromagnetic tran- 
sition temperature. Concerning the magni- 
tude of the magnetocapacitance, CdCr,S, is 
not a conventional multiferroic and cannot 
be treated with the same theoretical footing’. 
The dynamic nature of the effect”’’ explains 
the lack of correlation between polarization 
and magnetocapacitance. 

From electron probe microanalysis and 
the narrow linewidths in X-ray studies using 
synchrotron radiation, we exclude any non- 
homogeneous impurity distribution in our 
single crystals. Meanwhile, we have repro- 
duced all effects in annealed single crystals 
grown without chlorine and in ceramics 
doped with indium. 

We therefore think that our experiments 
provide evidence for intrinsic multiferroic 
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Figure 1| Temperature dependence of the 
relaxation rate of CdCr,S,. The relaxation rate, f, 
was calculated from the relaxation times T, shown 
in ref. 13, determined from fits of the frequency- 
dependent complex permittivity. These data are 
more accurate than those analysed in Fig. 1 of ref. 
1. As revealed by the Arrhenius representation 

of Fig. 1, at high temperatures the data follow 
thermally activated behaviour”’, but they also can 
be fitted by a Vogel-Fulcher law, with T; = 60 K 
(line). At low temperatures, strong ferromagnetic 
fluctuations’ lead to deviations owing to an 


acceleration of the relaxation dynamics””. 


behaviour, supporting early assumptions of 
polar phase transitions and magnetoelectric 
effects in spinels”. 
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Progressive field-state collapse and 
quantum non-demolition photon counting 


Christine Guerlin’, Julien Bernu', Samuel Deléglise’, Clement Sayrin’, Sébastien Gleyzes’, Stefan Kuhr't, 
Michel Brune’, Jean-Michel Raimond! & Serge Haroche’” 


The irreversible evolution of a microscopic system under measurement is a central feature of quantum theory. From an 
initial state generally exhibiting quantum uncertainty in the measured observable, the system is projected into a state in 
which this observable becomes precisely known. Its value is random, with a probability determined by the initial system's 
state. The evolution induced by measurement (known as ‘state collapse’) can be progressive, accumulating the effects of 
elementary state changes. Here we report the observation of such a step-by-step collapse by non-destructively measuring 
the photon number of a field stored in a cavity. Atoms behaving as microscopic clocks cross the cavity successively. By 
measuring the light-induced alterations of the clock rate, information is progressively extracted, until the initially uncertain 
photon number converges to an integer. The suppression of the photon number spread is demonstrated by correlations 
between repeated measurements. The procedure illustrates all the postulates of quantum measurement (state collapse, 


statistical results and repeatability) and should facilitate studies of non-classical fields trapped in cavities. 


The projection of a microscopic system into an eigenstate of the 
measured observable reflects the change of knowledge produced by 
the measurement. Information is either acquired in a single step, as in 
a Stern—Gerlach spin-component measurement’, or in an incremen- 
tal way, as in spin-squeezing experiments”’. A projective measure- 
ment is called ‘quantum non-demolition** (QND) when the 
collapsed state is invariant under the system’s free unitary evolution. 
Sequences of repeated measurements then yield identical results and 
jumps between different outcomes reveal an external perturbation”*. 

Various QND measurements have been realized on massive part- 
icles. The motional state of a trapped electron has been measured 
through the current induced in the trapping electrodes’. The internal 
state of trapped ions has been read out, directly by way of laser- 
induced fluorescence”, or indirectly through quantum gate opera- 
tions entangling them to an ancillary ion'’. Collective spin states of an 
atomic ensemble have been QND-detected through its dispersive 
interaction with light’. 

QND light measurements are especially challenging, as photons 
are detected with photosensitive materials that usually absorb them. 
Photon demolition is however avoidable’’. In non-resonant pro- 
cesses, light induces nonlinear dispersive effects'* in a medium, with- 
out real transitions. Photons can then be detected without loss. 
Dispersive schemes have been applied to detect the fluctuations of 
a signal light beam by the phase shifts it induces on a probe beam 
interacting with the same medium’®’’. Neither these methods, nor 
alternative ones based on the noiseless duplication of light by optical 
parametric amplifiers'”'*, have been able, so far, to pin down photon 
numbers. 

Single-photon resolution requires an extremely strong light— 
matter coupling, optimally achieved by confining radiation inside a 
cavity. This is the domain of cavity quantum electrodynamics'’?”', in 
which experiments attaining single-quantum resolution have been 
performed with optical’*”* or microwave photons, the latter being 
coupled either to Rydberg atoms**”° or to superconducting junc- 
tions’’. In a QND experiment, cavity losses should be negligible 


during a sequence of repeated measurements. We have realized a 
superconducting cavity with a very long field damping time’, and 
used it to detect repeatedly a single photon’. Here, we demonstrate 
with this cavity a general QND photon counting method applied to a 
microwave field containing several photons. It implements a variant 
of a procedure proposed in refs 30 and 31, and illustrates all the 
postulates of a projective measurement’. 

A stream of atoms crosses the cavity and performs a step-by-step 
measurement of the photon number-dependent alteration of the 
atomic transition frequency known as the ‘light shift’. We follow 
the measurement-induced evolution from a coherent state of light 
into a Fock state of well-defined energy, containing up to 7 photons. 
Repeating the measurement on the collapsed state yields the same 
result, until cavity damping makes the photon number decrease. The 
measured field energy then decays by quantum jumps along a stair- 
case-like cascade, ending in vacuum. 

In this experiment, light is an object of investigation repeatedly 
interrogated by atoms. Its evolution under continuous non- 
destructive monitoring is directly accessible to measurement, making 
real the stochastic trajectories of quantum field Monte Carlo simula- 
tions””**, Repeatedly counting photons in a cavity as marbles in a 
box opens novel perspectives for studying non-classical states of 
radiation. 


An atomic clock to count photons 


To explain our QND method, consider the thought experiment 
sketched in Fig. la. A photon box, similar to the contraption ima- 
gined in another context by Einstein and Bohr', contains a few 
photons together with a clock whose rate is affected by the light. 
Depending upon the photon number 1, the hand of the clock points 
in different directions after a given interaction time with the field. 
This time is set so that a photon causes a t/q angular shift of the hand 
(here q is an integer). There are 2q values (0, 1, ...2q—1) of the 
photon number corresponding to regularly spaced directions of the 
hand, spanning 360° (Fig. la shows the hand’s positions for q = 4and 


‘Laboratoire Kastler Brossel, Ecole Normale Supérieure, CNRS, Université Pierre et Marie Curie, 24 rue Lhomond, 75231 Paris Cedex 05, France. *Collége de France, 11 place Marcelin 
Berthelot, 75231 Paris Cedex 05, France. Present address: Johannes Gutenberg Universitat, Institut ftir Physik, Staudingerweg 7, 55128 Mainz, Germany. 
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n= 0, 1,3). For n= 2g, the readings periodically repeat and the clock 
measures n modulo 2q. 

This description is translatable into the atomic world (Fig. 1b). The 
evolution of a two-level atom crossing the cavity C is described as 
the rotation of a spin evolving on a Bloch sphere’’. The atomic levels, 
|+.), correspond to up and down spin states along the O—z direction. 
Before entering C, the spin | is rotated by a pulse R,; from |+,) into 
state |+,) = (|+.) + |—,))/2, represented by a vector along O-x. 
This vector then starts to rotate in the x-O-y equatorial plane, in 
analogy with the ticking of the hand in the thought experiment. The 
atomic flight time across C is adjusted to result, per photon, in a 1/q 
light-induced rotation of the spin. The 2q final spin states |+,,) cor- 
related with 0 = n< 2q correspond to vectors reproducing the posi- 
tions of the classical clock’s hand. 

In general, the photon number exhibits quantum uncertainty. The 
field, initially in a superposition 5>,c,|n) of Fock states |n), gets 
entangled with the spin, the final atom-field state becoming 
So nCnl + @|n). The spin points in a fuzzy direction that the QND 
measurement is desigaed. to pin down. As an example, consider a 
field in a coherent state™ of complex amplitude « defined by the 
C-numbers c,, = exp( (—|al7/ 2)(a"/\n!). Its photon number distri- 
bution, Poln) = lel? i is poisepnian with an average photon number 
no = ||? and a spread An= \M. 

If we could determine the final atom state, the clock’s delay—and 
hence n—would be read in a single measurement. This is however 
forbidden by quantum theory**. The 2q spin states are not mutually 
orthogonal (except for gq = 1, see below) and cannot be unambigu- 
ously distinguished (this ambiguity is exploited in quantum cryp- 
tography~’). Only partial information can be extracted from a spin, 
namely its projection along a direction O-u in the x-O-y plane 
making an arbitrary angle ¢ with O-x. 

The angle between O-u and the direction of the |+,,) spin state is 
nt/q—. The conditional Baas for detecting the |+,,) states 
when C contains n photons are P(j,@|n) = [1 + cos(nn/q — @ + jm) ]/2 
(using quantum information notation’’, we assign to the +/— spin 
states the values j = 0/1 and rename |j,#) the states |+ ,)). Measuring 
the spin along O—uis performed by submitting it, after cavity exit, toa 
pulse R, whose phase is set to map O—u onto O-z (Fig. 1b). This 
rotation is followed by the measurement of the atom’s energy, equi- 
valent to a spin detection along O—z. The combination of Ry and Risa 
Ramsey interferometer’. The probabilities for finding j=0 and 1 


O-u 


Figure 1| Principle of QND photon counting. a, Thought experiment with a 
clock in a box containing n photons. The hand of the clock undergoes a 11/4 
phase-advance per photon (n = 0, 1, 3 represented). b, Evolution of the 
atomic spin on the Bloch sphere in a real experiment: an initial pulse R, 
rotates the spin from O-z to O-x (left). Light shift produces a 1/4 phase shift 
per photon of the spin’s precession in the equatorial plane. Directions 
associated with n = 0 to 7 end up regularly distributed over 360° (centre). 
Pulse R2 maps the direction O-—u onto O-z, before the atomic state is read 
out (right). 
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along O-z oscillate versus ¢, which is a typical feature of quantum 
interference (Methods). 

The q= 1 case is a notable exception for which a single measure- 
ment yields complete information. There are then only two opposite 
hand positions on the atomic clock dial, corresponding to ortho- 
gonal states. Ideally, a single detection pins down n modulo 2, yield- 
ing the photon number parity. For weak fields with m <1, the 
probability for n> 1 is negligible and the parity defines n. The tele- 
graphic signals” obtained by detecting a stream of atoms reveal the 
photon number evolution, with quantum jumps between n = 0 and 1 
as the field randomly exchanges energy with the cavity walls. For 
larger fields, though, information must be extracted in a subtler way. 


Progressive pinning-down of photon number 


The random outcome of a spin detection modifies our knowledge of 
the photon number distribution. The conditional probability 
P(n\j,@) for finding n photons after detecting the spin value j along 
O-u is related to the inverse conditional probability P(j,@|n) b 
Bayes’ law®’: 


P(nljnp) 


where P(j,~) = >> ,,P(j,@|n) Po(n) is the a priori probability for j. This 
formula directly follows from the definition of conditional probabil- 
ities. It can also be derived from the projection postulate’. After 
detection of the spin in state |j,p), the entangled atom-field system 
collapses into [D> ,¢,.{j|+ .)|M]1@|p,oy/y 'P( (j,). This entails that the 
photon number probability is (up to a global factor) multiplied by 
Gil +0]? = PGln). 

Equation (1) embodies the logic of our QND procedure. The spin 
measurement has the effect of multiplying Po(m) by P(j,¢|1), which is 
a periodic function vanishing for specific values of n when ¢ is prop- 
erly adjusted. If we choose ¢ = mp/q (where p is an integer), O—-u 
points along the direction of |+,). This entails P(j=1,¢|p) = 
P(j=0, |p + q) = 0. Detecting the spin in 1 (resp. 0) excludes the 
photon number n= p (resp. p+q) as these outcomes are forbidden 
for the corresponding Fock states. One of the probabilities for finding 
n= p or n= p+q is cancelled, while the other is enhanced (when 
normalization is accounted for). At the same time, the probabilities 
of other photon numbers are modified according to equation (1). 

This decimation is robust against imperfections. In a realistic situ- 
ation, the theoretical probability P(j,¢|n) becomes P?)(j,@|n) = 
[A+ B cos(n® — @ + jm)]/2, where A and B are the Ramsey inter- 
ferometer fringes offset and contrast, somewhat different from 1. The 
phase shift per photon ® may also slightly depart from 1/q. Before a 
QND measurement, A, B and @ are determined by independent 
calibration. The limited contrast of the interferometer corresponds 
to a statistical uncertainty in the final atomic state. The atom and the 
field must then be described by density operators instead of pure 
states. The formula (1) remains valid with P(j,¢|n) replaced by 
P‘P)(j,|n). This is justified by Bayes’ law or by generalization of 
the measurement postulate to statistical mixtures”. 

In order to obtain more information, we repeat the process and send 
a sequence of atoms across C. This results in a step-by-step change of 
the photon number distribution. From one atom to the next, we vary @. 
Calling #(k) the detection angle for the Ath atom and j(k) its spin 
reading, the photon number distribution after N atoms becomes: 

N 

Pol) TH [A+ Beos(nd—9(k) +)(K)n)) (2) 
where Z enforces normalization. For an efficient decimation, we altern- 
ate between detection directions nearly coinciding with the vectors 
associated with q non-orthogonal |+,) states. Each atom has a chance 
to reduce the probability of a photon number different from the one 
decimated by its predecessor. After a finite number of steps, numerical 
simulations predict that a single n value (modulo 2q) survives. 


= Po(n)P(j,|n)/ PU) = Po(n)[1 + cos(nn/q — b + jr) ]/2PG,0) (1) 


Py(n) = 
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Observing the field-state collapse 

We have applied this procedure to a coherent microwave field at 
51.1GHz stored in an ultrahigh-Q Fabry—Pérot cavity made of 
niobium-coated superconducting mirrors”*. Our set-up is described 
in ref. 29. The cavity has a very long damping time T- = 0.130s. It is 
cooled to 0.8 K (average thermal photon number n, = 0.05). The field 
is prepared by coupling a short microwave pulse into C (by way of 
diffraction on the mirrors’ edges**). Its photon number distribution 
and average photon number, mo = 3.82 + 0.04, are inferred from the 
experimental data (see below). Our single-photon-sensitive spin- 
clocks are circular Rydberg atoms of rubidium. They cross C succes- 
sively, separated on average by 2.33 X 10 “s. Parameters are adjusted 
to realize a ~1/4 clock shift per photon (Methods), corresponding to 
eight positions of the spin on the Bloch sphere (Fig. 1b). This con- 
figuration is adapted to count photon numbers between 0 and 7. For 
Ny = 3.82, the probability for n= 8 is 3.5%. 

Four phases @; (i= a, b, c, d), corresponding to directions pointing 
approximately along the spin states associated with n = 6, 7, 0, 1, are 
used, in random order, for successive atoms (Methods). A sequence 
of j values can be decoded only when combined with the correspond- 
ing phase choices, in analogy with the detection basis reconciliation 
of quantum key distribution protocols**. Figure 2a shows the data 
from the first 50 detected atoms, presented as (7, i) doublets, for two 
independent detection sequences performed on the same initial field. 

From these real data, we compute the products of functions 
IT y(n) = T= 1...N) [A + Bcos(n® _ Pie + j(k)r)]. The A, B, @P 
and ¢; values are given by Ramsey interferometer calibration 
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(Methods). The evolutions of /7)(m), displayed as functions of n 
treated as a continuous variable, are shown in Fig. 2b for N increasing 
from 1 to 50. The /7)(m) functions converge into narrow distribu- 
tions whose widths decrease as more information is acquired. These 
functions are determined uniquely by the experimental data. Their 
evolution is independent of any a priori knowledge of the initial 
photon distribution. The data sequence itself; however, depends of 
the unknown state of the field, which the measurement reveals. 

Inserting /7\(n) into equation (2) and extending the procedure to 
N= 110, we obtain the evolution of the photon number histograms 
for these two realizations (Fig. 2c). These histograms show how our 
knowledge of the field state evolves in a single measuring sequence, as 
inferred from baysian logic. The initial distributions (Po() = 1/8) 
are flat because the only knowledge assumed at the beginning of each 
sequence is the maximum photon number pax. Data are analysed 
after the experiment, but P)(m) could also be obtained in real time. 
The progressive collapse of the field into a Fock state (here |n = 5) or 
|n = 7)) is clearly visible. Information extracted from the first 20 to 30 
atoms leaves an ambiguity between two competing Fock states. After 
~50 atoms (detected within ~0.012s), each distribution has turned 
into a main peak with a small satellite, which becomes totally neg- 
ligible at the end of the two sequences. 


Reconstructing photon number statistics 


Repeatedly preparing the field in the same coherent state, we have 
analysed 2,000 independent sequences, each made of 110 (j, i) doublets 
recorded within T,, ~ 0.026 s. This measuring time is a compromise. 


Figure 2 | Progressive collapse of 
field into photon number state. 
a, Sequences of (j, i) data (first 50 


gj 1101111111110011101101111 j 0001000110110000001010110 

i ddcbccabcdaadaabadddbadbc i ddcaddabbccdccbcdaabbccab atoms) produced by two 
independent measurements. 

j 0101001101010101101011111 j 0001010100000100011101101 b, Evolution of /7y(n) for the two 

i dababbaacbecdadccdcbaaace i bcdaddaabbbbdbdcdccadaada sequences displayed in a, when N 


increases from 1 to 50, n being 
treated as a continuous variable 
(integral of /7,(n) normalized to 
unity). c, Photon number 
probabilities plotted versus photon 
and atom numbers and N. The 
histograms evolve, as N increases 
from 0 to 110, from a flat 
distribution into n = 5 andn=7 
peaks. 
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Figure 3 | Reconstructed photon number distribution. Histogram of (n) 
values obtained from 2,000 QND collapse sequences (each involving N = 110 
atoms). The (m)s are sampled in intervals of 0.2. The error bars are the 
statistical standard deviations. The peaks at integer numbers reveal Fock 
states. The background is due to incomplete or interrupted collapses. Data 
shown as blue circles are obtained by fitting the distribution of integer number 
peaks to a Poisson law, yielding (n) aye = 3.46 + 0.04 (the blue line represents a 
continuous Poisson distribution joining the circles as a guide for the eye). 


Short compared to T,, it is long enough to allow for a good convergence 
of the photon number distribution. We have computed the mean 
photon number (1) = 5>,,nPy(n) at the end of each sequence. The 
histogram of these (n) values, sampled in bins of width 0.2, is shown 
in Fig. 3. Peaks at integers appear on top of a small background due to 
sequences that have not fully collapsed, or that have been interrupted 
by field decay. 

The histogram of the peaks in Fig. 3 directly reveals the photon 
number probability distribution of the initial field, modified by damp- 
ing during the measurement. Disregarding the 23% background, we fit 
the experimental histogram of integer values to a Poisson law with 
(Nave = 3.46 + 0.04 (blue circles), and normalized to 0.77 (probability 
of fully converged sequences). This is the expected distribution for a 
coherent field with an initial mean np = 3.82 + 0.04, after decay during 
the time T,,,/2 ~ T,/10. Remarkably, the non-converged sequences do 
not introduce any noticeable bias in the distribution of fully collapsed 
measurements. The experimental excess probability of 0.019 + 0.006 
for n = 0 is well understood. It is due to the measurement being per- 
formed modulo 8, which attributes n = 8 events (0.012 probability) to 
the n= 0 bin. The near-perfect agreement of the fit with the experi- 
ment provides a direct verification of the quantum postulate about the 
probabilities of measurement outcomes. 


NATURE| Vol 448/23 August 2007 


Repeated measurements and field jumps 


Repeatability is another fundamental feature of an ideal QND mea- 
surement. To test it, we follow the evolution of the field state along 
sequences made of ~2,900 atoms. We determine P)() and (n) up to 
N= 110. We then drop the first atom and replace it with the 111st 
one, resuming the calculation with a flat initial distribution and 
obtain a new (n). We repeat the procedure atom by atom. We thus 
decode continuously a single field history versus time. Measurements 
separated by more than T,,, exploit independent information. 

Figure 4a shows the evolution of (n) over 0.7s for the two 
sequences whose initial data are displayed in Fig. 2a. In each case, 
(n) evolves quickly towards an integer (5 or 7). This collapse is fol- 
lowed by a plateau, corresponding in these two realizations to ~2 
independent measurements. Eventually, cavity damping results in a 
photon loss: a quantum jump occurs, decreasing (7) by one. This 
event is recorded after a delay of a fraction of T,,, with respect to the 
real jump time, as several atoms are required after the quantum leap 
to build up the new photon number probability. Note in the inset of 
Fig. 4a that it takes about 0.01 s for the atoms to ‘realize’ that a jump 
has occurred. The staircase-like evolution of the field proceeds in this 
way down to vacuum. Figure 4b presents four other examples of 
signals following a collapse into n= 4. The randomness of the step 
durations is typical of quantum dynamics. In one of these recordings 
(leftmost panel), the n= 4 step lasts 0.235, corresponding to ~9 
independent QND measurements 

The lifetime of a n-Fock state, T./n at T = 0 K (ref. 39), is reduced 
by thermal effects to T./[n + n,(2n + 1)], that is, ~0.029 s, ~0.023 s 
and ~0.017 s for n = 4, 5 and 7, respectively. The statistical analysis of 
2,000 QND sequences, each of 0.7s duration, provides a detailed 
description of the dynamical evolution under cavity relaxation of 
Fock states with n up to 7 (J.B. et al., manuscript in preparation). 
The sequences of Fig. 4, in which some Fock states survive much 
longer than their lifetimes, are relatively rare events. We have selected 
them to demonstrate the ability of the QND procedure to generate 
and repeatedly measure large n-Fock states of radiation in a cavity. 


Beyond energy measurements 

Our QND source of Fock states operates in a way different from 
previous methods based on resonant®*’ or Raman‘! processes in 
cavity quantum electrodynamics, which have so far been limited to 
smaller photon numbers (n= 1 or 2). This QND measurement opens 
novel perspectives for the generation of non-classical states of light. If 
the initial photon number distribution spans a range of n values larger 
than 2q, the decimations induced by successive atoms do not distin- 
guish between n and n + 2q. The field then collapses into a superposi- 
tion of the form S74¢,+24|1+ 2q). For instance, ¢o|0) + c4/2q) 


Figure 4 | Repeated QND measurements. 


a, Mean photon number (7) followed over 0.7 s 
for the two sequences whose collapse is analysed 
in Fig. 2. After converging, (n) remains steady for 
a while, before successive quantum jumps bring it 
1 down to vacuum. Inset, zoom into the n = 5 to 4 
J jump, showing that it is detected in a time of 

J ~0.01 s. b, Four other signals recording the 
evolution of () after field collapse into n = 4. 
Note in the leftmost frame the exceptionally long- 


lived n = 4 state, and in the rightmost frame the 
n= 1 to 2 jump revealing a thermal field 
fluctuation. 
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represents a field coherently suspended between vacuum and 2q 
photons. This superposition of states with energies differing by many 
quanta is a new kind of ‘Schrédinger cat’ state of light. 

Superpositions of field states with the same amplitude but different 
phases—‘Schrédinger cats’ of a different kind—are also generated in 
this experiment. As the photon number converges, its conjugate vari- 
able, the field’s phase, gets blurred. After the first atom’s detection, the 
initial state collapses into a superposition of two coherent states with 
different phases*'**. Each of these components is again split into two 
coherent states by the next atom and so on, leading to complete phase- 
uncertainty when the photon number has converged*'. The evolution 
of the Schrédinger cat states generated in the first steps of this process 
could be studied by measuring the field Wigner function’. 
Decoherence*** of superpositions of coherent states containing many 
photons could be monitored in this way. 


METHODS SUMMARY 


The preparation and detection of circular Rydberg atoms, the cavity and the 
Ramsey interferometer are described elsewhere**®”’. The |+,) states are the 
circular Rydberg levels of rubidium with principal quantum numbers 51 and 
50 (transition frequency ~51.1 GHz). The theoretical phase shift per photon”®”® 
is Q77/26, where Q/27 = 50 kHz is the vacuum Rabi frequency at cavity centre, 
6/2r is the atom—cavity detuning and t= 3 X 10° s is the effective atom—cavity 
interaction time. It is defined as t= (1/2)? w/v, where w is the waist of the 
gaussian cavity field mode (w=6mm) and v the atomic velocity 
(v= 250ms_'). This effective time is obtained by averaging the spatial variation 
of the square of the atom-field coupling as the atom crosses the cavity mode””®. 

More details about the experimental settings, including the determination of 
the A and B parameters, the fine tuning of the phase shift per photon, ®, and the 
adjustment of the four phases of the Ramsey interferometer are given in the 
Methods section. We also analyse the adiabaticity of the atom—field coupling, 
which is an essential feature of our measurement. We describe the generation of 
sequences of atoms crossing the cavity one at a time with a well-defined velocity, 
and we discuss the effect of rare multi-atom events on the detection signals. We 
also explain why the sequences of detection directions ; occur randomly in a 
measuring sequence. We conclude by discussing alternative strategies to pin 
down the photon number non-destructively. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 2 May; accepted 28 June 2007. 


1. Wheeler, J. A. & Zurek, W. H. Quantum Theory and Measurement (Princeton Series 
in Physics, Princeton, New Jersey, 1983). 

2. Kuzmich, A., Mandel, L. & Bigelow, N. P. Generation of spin squeezing via 
continuous quantum non-demolition measurement. Phys. Rev. Lett. 85, 
1594-1597 (2000). 

3.  Geremia, J. M., Stockton, J. K. & Mabuchi, H. Suppression of spin projection noise 
in broadband atomic magnetometry. Phys. Rev. Lett. 94, 203002 (2005). 

4. Braginsky, V. B. & Vorontsov, Y. |. Quantum mechanical limitations in 
macroscopic experiments and modern experimental technique. Usp. Fiz. Nauk 114, 
41-53 (1974); Sov. Phys. Usp. 17, 644-650 (1975). 

5. Thorne, K. S., Drever, R. W. P., Caves, C. M., Zimmerman, M. & Sandberg, V. D. 

Quantum nondemolition measurements of harmonic oscillators. Phys. Rev. Lett. 

40, 667-671 (1978). 

6. Unruh, W. G. Analysis of quantum nondemolition measurement. Phys. Rev. D 18, 

764-1772 (1978). 

7. Caves, C. M., Thorne, K.S., Drever, R. W. P., Sandberg, V. D. & Zimmerman, M. On 

he measurement of a weak classical force coupled to a quantum mechanical 

oscillator. Rev. Mod. Phys. 52, 341-392 (1980). 

8. Milburn, G. J. & Walls, D. F. Quantum nondemolition measurements via quadratic 

coupling. Phys. Rev. A 28, 2065-2070 (1983). 

9. Peil, S.& Gabrielse, G. Observing the quantum limit of an electron cyclotron: QND 

measurement of quantum jumps between Fock states. Phys. Rev. Lett. 83, 

287-1290 (1999). 

10. Leibfried, D., Blatt, R., Monroe, C. & Wineland, D. J. Quantum dynamics of single 

rapped ions. Rev. Mod. Phys. 75, 281-324 (2003). 

Tl. Schmidt, P.O. etal. Spectroscopy using quantum logic. Science 309, 749-752 (2005). 

12. Kuzmich, A. et al. Quantum non-demolition measurements of collective atomic 

spin. Phys. Rev. A 60, 2346-2350 (1999). 

13. Grangier, P., Levenson, J. A. & Poizat, J. P. Quantum non-demolition 

measurements in optics. Nature 396, 537-542 (1998). 

14. Imoto, N., Haus, H. A. & Yamamoto, Y. Quantum nondemolition measurement of 

he photon number via the optical Kerr effect. Phys. Rev. A 32, 2287-2292 (1985). 


ARTICLES 


5. Grangier, P., Roch, J. F. & Roger, G. Observation of backaction-evading 
measurement of an optical intensity in a three-level atomic non-linear system. 
Phys. Rev. Lett. 66, 1418-1421 (1991). 

6. Roch, J. F. et al. Quantum nondemolition measurements using cold trapped 
atoms. Phys. Rev. Lett. 78, 634-637 (1997). 

7. Bencheikh, K., Levenson, J. A., Grangier, P. & Lopez, O. Quantum nondemolition 
demonstration via repeated backaction evading measurements. Phys. Rev. Lett. 
75, 3422-3425 (1995). 

8. Bruckmeier, R., Schneider, K., Schiller, S. & Mlynek, J. Quantum nondemolition 
measurements improved by a squeezed meter input. Phys. Rev. Lett. 78, 
1243-1246 (1997). 

9. Haroche, S. in Fundamental Systems in Quantum Optics (eds Dalibard, J. & 
Raimond, J. M.) 767-940 (Les Houches Summer School, Session LIII, North 
Holland, Amsterdam, 1992). 

20. Haroche, S. & Raimond, J. M. Exploring the Quantum: Atoms, Cavities and Photons 

(Oxford Univ. Press, Oxford, UK, 2006). 

21. Walther, H., Varcoe, B. T. H., Englert, B. G. & Becker, T. Cavity quantum 
electrodynamics. Rep. Prog. Phys. 69, 1325-1382 (2006). 

22. Birnbaum, K. M. et al. Photon blockade in an optical cavity with one trapped atom. 
Nature 436, 87-90 (2005). 

23. Wilk, T., Webster, S. C., Kuhn, A. & Rempe, G. Single-atom single-photon 
quantum interface. Science doi:10.1126/science.1143835 (published online 21 
June 2007). 

24. Nogues, G. et al. Seeing a single photon without destroying it. Nature 400, 
239-242 (1999). 

25. Varcoe, B. T. H., Brattke, S., Weidinger, M. & Walther, H. Preparing pure photon 
number states of the radiation field. Nature 403, 743-746 (2000). 

26. Raimond, J. M., Brune, M. & Haroche, S. Manipulating quantum 
entanglement with atoms and photons in a cavity. Rev. Mod. Phys. 73, 565-582 
(2001). 

27. Schuster, D. |. et al. Resolving photon number states in a superconducting circuit. 
Nature 445, 515-518 (2007). 

28. Kuhr, S. et al. Ultrahigh finesse Fabry-Pérot superconducting resonator. Appl. Phys. 
Lett. 90, 164101 (2007). 

29. Gleyzes, S. et al. Quantum jumps of light recording the birth and death of a photon 
in a cavity. Nature 446, 297-300 (2007). 

30. Brune, M., Haroche, S., Lefévre, V., Raimond, J. M. & Zagury, N. Quantum non- 
demolition measurement of small photon numbers by Rydberg atom phase- 
sensitive detection. Phys. Rev. Lett. 65, 976-979 (1990). 

31. Brune, M., Haroche, S., Raimond, J. M., Davidovich, L. & Zagury, N. Quantum non- 
demolition measurements and generation of Schrédinger cat states. Phys. Rev. A 
45, 5193-5214 (1992). 

32. Carmichael, H. An Open System Approach to Quantum Optics (Springer, Berlin, 
1993). 

33. Nielsen, M. A. & Chuang, |. L. Quantum Computation and Quantum Information 
(Cambridge Univ. Press, Cambridge, UK, 2000). 

34. Glauber, R. J. Coherent and incoherent states of the radiation field. Phys. Rev. 131, 
2766-2788 (1963). 

35. D'Ariano, G. M. & Yuen, H. P. Impossibility of measuring the wave function of a 
single quantum system. Phys. Rev. Lett. 76, 2832-2835 (1996). 

36. Gisin, N., Ribordy, G. G., Tittel, W. & Zbinden, H. Quantum cryptography. Rev. 
Mod. Phys. 74, 145-195 (2002). 

37. Ramsey, N. Molecular Beams (Oxford Univ. Press, Oxford, UK, 1984). 

38. Bayes, T. An essay towards solving a problem in the doctrine of chances. Phil. 
Trans. R. Soc. Lond. 53, 370-418 (1763). 

39. Lu, N. Effects of dissipation on photon statistics and the lifetime of a pure number 
state. Phys. Rev. A 40, 1707-1708 (1989). 

40. Maitre, X. et al. Quantum memory with a single photon in a cavity. Phys. Rev. Lett. 

79, 769-772 (1997). 

41. Bertet, P. et al. Generating and probing a two-photon Fock state with a single atom 

in a cavity. Phys. Rev. Lett. 88, 143601 (2002). 

42. Brune, M. et al. Observing the progressive decoherence of the meter in a quantum 

measurement. Phys. Rev. Lett. 77, 4887-4890 (1996). 

43. Bertet, P. et al. Direct measurement of the Wigner function of a one photon Fock 

state in a cavity. Phys. Rev. Lett. 89, 200402 (2002). 

44. Giulini, D., et al. Decoherence and the Appearance of a Classical World in Quantum 

Theory (Springer, Berlin, 1996). 

45. Zurek, W. H. Decoherence, einselection and the quantum origins of the classical. 

Rev. Mod. Phys. 75, 715-775 (2003). 


Acknowledgements This work was supported by the Agence Nationale pour la 
Recherche (ANR), by the Japan Science and Technology Agency (JST), and by the 
EU under the IP projects SCALA and CONQUEST. C.G. and S.D. were funded by the 
Deélégation Générale a l'Armement (DGA). J.-M.R. is a member of the Institut 
Universitaire de France (IUF). 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Correspondence and requests for materials should be addressed to M.B. 
(brune@lkb.ens.fr) or S.H. (haroche@lkb.ens.fr). 


893 


©2007 Nature Publishing Group 


doi:10.1038/nature06057 


METHODS 

Experimental settings. To calibrate the Ramsey interferometer, we send atoms 
prepared in |—.) across the set-up, with C empty. We measure the probability 
P(j = 0,@) for detecting the atom in |+-,) as a function of the relative phase 
between R, and R;. From the Ramsey fringes, we obtain the phase origin ¢ = 0 
corresponding to a detection along the |+9) spin direction. 

Tuning of 6 is performed by moving the cavity mirrors with piezoelectric 
actuators. In theory, a 7/4 phase shift corresponds to 6/2n = 300 kHz. We set 
the detuning close to this value (with a 15 kHz uncertainty due to imperfect 
knowledge of the atomic transition frequency affected by small residual Stark 
and Zeeman shifts). With this detuning, the photon-induced phase shift is, 
within 3%, a linear function of n for n= 0 to 7. 

By Stark-shifting the atomic spin phase with a short electric field pulse 
applied just before R, and adjusted to different amplitudes, we translate the 
fringes and set ¢ to four different values, close to It/4 (I= —2, —1, 0 and 1). 
After preliminary settings of ® and ¢;, we refine our calibration. We inject a 
small coherent field in C (mj = 1.2). This field has a negligible probability 
for n> 4. For each $j, we record the fraction 170(@;) of spins found in |+) on 
a sequence of atoms crossing C in a time short compared to T,.. Repeating 
the sequence many times, we find distributions of 1o(j)s, which we fit 
as a sum of five peaks centred on the discrete values equal to 
POP) 0,p:|n) = [A+ Bcos(n® — ¢;)]/2, with n=0 to 4. From a best fit of 
the 70 distributions, we get the values A = 0.907 + 0.004, B= 0.674 + 0.004, 
P/n = 0.233 + 0.004, 6/1 0.464 + 0.013, —0.229 + 0.009, —0.015 + 0.007 
and +0.261 + 0.006 (i= a, b, c d). These values are inserted in equation (2). The 
fringe contrast B is reduced below 1 by experimental imperfections (stray fields, 
detection errors, two-atom events, see below). The other six parameters A, ®/t 
and ¢;/n (i= a, b, c, d) are close to their ideal values (1, %, —'2, —Y%, 0, “, 
respectively). 

Adiabaticity. The adiabatic variation of the coupling as the atoms cross the 
gaussian profile of the cavity mode keeps the atomic emission rate extremely 
low. The theoretical probability that an atom deposits an additional photon 
when the cavity contains a coherent field with 1g = 3.82 is below 1.3 X 10°. 
Consistent with this very small value, we found on analysing our experimental 
data that the average number of photons deposited in C by a sequence of ~2,900 
atoms is negligible compared to 1. 

Atomic sequences. They are realized by pulsing the Rydberg atom preparation. 
The atoms are excited from a thermal atomic beam, velocity selected by optical 
pumping, at a rate of 1.4X10* pulses per second. The velocity spread, 
Av=+1ms | around 250ms ‘|, has a negligible effect on the Ramsey fringe 
contrast. In order to limit the number of events with two atoms per pulse, the 
intensity of the exciting lasers is kept low (average number of detected Rydberg 
atoms per pulse is 0.3, detection efficiency 50%). Undetected atoms do not affect 
the photon number distribution. A single atom is counted in 22% of the pre- 
paration pulses, while 3% of them contain a detected atom pair. When two 
atoms (whether detected or not) cross C together there is a slight reduction of 
the interferometer contrast, owing to small cavity-mediated interactions*® 
between the atoms. This reduction is taken into account in the measured B value. 
All detected events with one or two atoms per pulse are compiled independently 
in the data analysis. When three atoms are in C together the fringe contrast is 
strongly reduced, but the probability of these events is small (2% probability for 
preparing 3 atoms or more per pulse). 

Randomness of detection directions. The interferometer phase is changed from 
pulse to pulse, going cyclically from , to @4. As the presence of one (or two) 
atoms ina given pulse is random, we cannot predict which phase will correspond 
to the next observed atom. We acquire this knowledge by detecting the atom, and 
i(k) is thus a randomly measured variable. 

Other QND measurement strategies. Efficient photon number decimation 
could be obtained by alternative methods. As suggested in refs 30 and 31, we could 
change the atom—cavity interaction time (and hence the phase shift per photon) by 
detecting randomly atoms from a thermal atomic beam, without velocity selec- 
tion. The optimal data acquisition procedure consists in applying to successive 
atoms a sequence of 1, 1/2,7/4 .... phase shifts per photon, while adjusting ¢ for 
each spin, based on the result of the previous measurement. This expresses 7 in 
binary code, each atom providing a bit of information”**’. The required number 
of atoms per measuring sequence is then minimal, equal to the smallest integer 
= logs(Mnax + 1). This ideal strategy requires however a deterministic beam of 
atoms, with perfect Ramsey fringe contrast and 100% detection efficiency. 
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Cortico-striatal synaptic defects and OCD- 
like behaviours in Sapap3-mutant mice 


Jeffrey M. Welch'*, Jing Lu’**, Ramona M. Rodriguiz’, Nicholas C. Trotta’, Joao Peca’”, Jin-Dong Ding®, 
Catia Feliciano’, Meng Chen®, J. Paige Adams®, Jianhong Luo’, Serena M. Dudek’, Richard J. Weinberg®, 


Nicole Calakos’*, William C. Wetsel* & Guoping Feng’ 


Obsessive-compulsive disorder (OCD) is an anxiety-spectrum disorder characterized by persistent intrusive thoughts 
(obsessions) and repetitive actions (compulsions). Dysfunction of cortico-striato-thalamo-cortical circuitry is implicated in 
OCD, although the underlying pathogenic mechanisms are unknown. SAP90/PSD95-associated protein 3 (SAPAP3; also 
known as DLGAP3) is a postsynaptic scaffolding protein at excitatory synapses that is highly expressed in the striatum. Here 
we show that mice with genetic deletion of Sapap3 exhibit increased anxiety and compulsive grooming behaviour leading to 
facial hair loss and skin lesions; both behaviours are alleviated by a selective serotonin reuptake inhibitor. 
Electrophysiological, structural and biochemical studies of Sapap3-mutant mice reveal defects in cortico-striatal synapses. 
Furthermore, lentiviral-mediated selective expression of Sapap3 in the striatum rescues the synaptic and behavioural 
defects of Sapap3-mutant mice. These findings demonstrate a critical role for SAPAP3 at cortico-striatal synapses and 
emphasize the importance of cortico-striatal circuitry in OCD-like behaviours. 


OCD, a common and incapacitating psychiatric disorder, affects 
~2 per cent of the world population’. OCD is characterized by 
persistent intrusive thoughts (obsessions), repetitive actions (com- 
pulsions) and excessive anxiety. Clinical expression of OCD is het- 
erogeneous in the types of obsessions and compulsions, heritability 
and co-morbid conditions, probably reflecting heterogeneity in the 
underlying pathology’. In addition, there are many disorders that 
share features with OCD, termed ‘OC-spectrum disorders’, which 
include Tourette’s syndrome, trichotillomania and body dysmorphic 
disorder. 

The neurobiological basis of OCD is unclear. However, lesions, 
functional neuro-imaging, and neuropsychological studies have 
indicated that the cortico-striato-thalamo-cortical circuitry may 
have a key role in the pathogenesis of OCD**. Familial studies of 
OCD indicate that the risk to first degree relatives is 3-12 times 
greater than the general population, similar to familial risk rates 
observed for bipolar disorder and schizophrenia®*. In addition, con- 
cordance for OCD is greater among pairs of monozygotic (80-87%) 
than dizygotic (47-50%) twins”"®. However, no genetic factors have 
yet been identified as a cause of OCD, presumably reflecting aetio- 
logical heterogeneity within the disorder. Although many of the 
symptoms of OCD can be ameliorated by enhancing serotonin 
neurotransmission, it is not clear at the cellular level whether defects 
in the serotonergic system are the primary cause*”. Indeed, dopami- 
nergic and glutamatergic neurotransmitter systems have also been 
implicated*"'. Here we show that targeted deletion of Sapap3 in mice 
leads to a behavioural phenotype similar to OCD: compulsive over- 
grooming behaviour, increased anxiety, and response to selective 
serotonin reuptake inhibitors. In these mutant mice, we have iden- 
tified defects at cortico-striatal synapses, part of the circuitry impli- 
cated in OCD, and we show that selective expression of Sapap3 in the 


striatum rescues the synaptic and behavioural defects. These findings 
indicate that defects in excitatory transmission at cortico-striatal 
synapses may underlie some aspects of OCD. 


Sapap3~’~ mice exhibit self-inflicted facial lesions 


SAPAP family proteins were originally identified as postsynaptic 
density (PSD) components that interact with the PSD95 and Shank 
families of proteins'*'’, two other multi-domain postsynaptic scaf- 
folding proteins at excitatory synapses. Together, these three groups 
of proteins are thought to form a key scaffolding complex that reg- 
ulates the trafficking and targeting of neurotransmitter receptors and 
signalling molecules to the postsynaptic membrane of excitatory 
synapses'*"'®. There are four highly homologous genes encoding 
members of the SAPAP family'’. Of these, Sapap1, 3 and 4are highly, 
but differentially, expressed in several regions of the brain’”’’. 
Notably, Sapap3 is the only member highly expressed in the striatum. 
To facilitate study of the in vivo function of SAPAP proteins at 
synapses, we generated knockout mice for the Sapap3 gene using 
homologous recombination in mouse embryonic stem cells (Supple- 
mentary Fig. la—d). Mice homozygous for the Sapap3 deletion 
(Sapap3 ‘~) were born at the expected mendelian rate, grew to 
adulthood with body weights similar to wild-type mice, and were 
fertile. Anatomical and histological analyses of brain showed that 
Sapap3 '~ mice were grossly normal (data not shown). 

By the age of 4-6 months, however, Sapap3 ‘~ mice developed 
lesions on their head, neck and snout regions (Fig. 1a). This pheno- 
type was 100% penetrant. Lesions were usually first noticed as a patch 
of hairless skin under the eyes, or swelling of the snout, progressing to 
relatively symmetric bilateral lesions encompassing large parts of 
the neck and head. Sapap3 ‘~ mice developed lesions regardless of 
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whether they were housed alone or with cage mates. Lesions were not 
observed on wild-type or heterozygous mice even when housed in 
the same cage with Sapap3 ‘~ mice from birth, thereby excluding the 
possibility that lesions were caused by allogrooming or were the 
result of aggressive encounters with other Sapap3 ‘~ mice. In fact, 
Sapap3 ‘~ mice were not observed to behave aggressively, but were 
often seen engaged in self-grooming whether they were housed alone 
or in groups. 

Given the obvious lesions on the Sapap3 '~ mice, we tested the 
possibility that their lesions might be caused by peripheral cutaneous 
defects, such as inflammation or abnormal afferent sensation. We 
examined facial skin from pre-lesion Sapap3 ‘~ mice that exhibited 
increased grooming. Histological analysis of skin did not reveal any 
anatomical differences among wild-type, Sapap3*'~ and Sapap3 '~ 
mice, and no lymphocytic or granulocytic infiltration was observed 
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Figure 1 | Facial lesions, excessive grooming and anxiety-like behaviours in 
Sapap3-mutant mice. a, Sapap3 ‘ mice have facial and neck skin lesions 
(arrows). b, ¢, Sapap3 ' ~ mice (KO) showed more grooming bouts (b) and 
spent more time in self-grooming (c) than wild-type mice (WT) at all times 
examined. d, Pre-lesion and post-lesion groups of Sapap3 '~ mice showed 
similar degrees of increased grooming. e, f, In the open field test, Sapap3 ‘~ 
mice spent less time in the centre (e), whereas locomotion along the 
perimeter was comparable to wild-type controls (f). g—i, In the dark-light 
emergence test, Sapap3 '~ mice took longer to cross from the dark to the 
brightly lit chamber (g) and spent less time in the brightly lit chamber 

(h), whereas total activity in both chambers was similar (i). j-I, In the 
elevated zero maze, Sapap3 '~ mice took longer to cross into the open areas 
(j) and spent less time in these areas than wild-type controls (k), whereas 
total activity in both open and closed areas was similar (I). *P < 0.05, 

**P < 0.01, ***P < 0.001, repeated measures ANOVA for b-e and two-tailed 
t-test for f-I; all data are presented as means ~ s.e.m. from 8—9 mice per 
genotype; Cohen’s kappa for intra-observer agreement exceeded 0.92. 


ARTICLES 


(Supplementary Fig. le, f). We did, however, find lymphocytic/ 
granulocytic infiltration in skin with lesions, probably owing to 
injury and infection (Supplementary Fig. 1g). No differences were 
detected in sensory innervation among wild-type, Sapap3‘/~ and 
Sapap3 ‘~ mice; hair-nerve end organs, lobular corpuscle-like nerve 
endings, and free epidermal nerve endings were present and not 
different between Sapap3 ‘~ and control mice (Supplementary 
Fig. 1h, i). Thus, our examination revealed no obvious peripheral 
defects that would indicate a cause for the lesions. Together, these 
findings raised the possibility that Sapap3'~ mice have excessive 
and injurious self-grooming behaviour. 


OCD-like behaviours of Sapap3~’~ mice 


To determine whether Sapap3 '~ mice groom excessively, we con- 
tinuously videotaped habituated, individually housed mice for 24h. 
Isolated Sapap3 '~ mice showed dramatically increased grooming 
bouts and spent significantly more time self-grooming than wild- 
type litter mates (Fig. 1b, c). The increased grooming was present 
throughout the day, including the period when mice usually sleep 
(10:00-14:00). Both mutant mice with facial lesions and mice yet to 
develop lesions showed similar degrees of increased grooming 
(Fig. 1d), indicating that the lesion itself was not the cause of 
increased grooming. We thus conclude that Sapap3 ‘~ mice have 
excessive and injurious levels of self-grooming, a phenotype remin- 
iscent of compulsive behaviours. 

Because the Sapap3 '~ mice exhibited a compulsive-like beha- 
viour, we considered whether their phenotypes further resemble 
OC-spectrum disorders. We therefore examined other OCD-like 
behaviours that are measurable in mice, such as increased anxiety 
and response to typical pharmacological agents. To test whether 
Sapap3 ‘~ mice exhibit increased anxiety, we conducted open field, 
dark-light emergence, and elevated zero maze tests. In the open field, 
mice with anxiety-like phenotypes tend to stay along the walls, avoid- 
ing the centre zone. Sapap3 ‘~ mice spent much less time exploring 
the centre area than wild-type litter mates (Fig. le). However, activ- 
ities along the walls and corners, areas believed to be less stressful, 
were not different between wild-type and Sapap3 ‘~ mice (Fig. If). 
In the dark—light emergence test, the latency to cross from a dark into 
a brightly lit chamber (a stressful environment), and the time spent in 
the brightly lit chamber were examined. Sapap3 '~ mice took longer 
to cross from the dark to the lit chamber and spent less time in the lit 
chamber than wild-type controls (Fig. 1g, h), even though total activ- 
ity in the chambers was similar between wild-type and Sapap3 '~ 
mice (Fig. li). Finally, in the elevated zero maze test, we found that 
Sapap3 ‘~ mice took longer to cross into the open areas (riskier 
environment) and spent significantly less time exploring the open 
areas compared with wild-type controls (Fig. 1j, k), whereas total 
activity was not different between the two genotypes (Fig. 11). 
Together, these results indicate that Sapap3 ‘~ mice have an anxiety- 
like phenotype. 

We next evaluated whether drugs used to treat OCD would be 
effective in reducing the abnormal behaviour in Sapap3 ‘~ mice. 
Selective serotonin reuptake inhibitors are a first-line treatment for 
OCD; although their mechanism of action is not well understood, 
these drugs alleviate symptoms in approximately 50% of OCD 
patients’. Fluoxetine treatment for 6 days (once per day at 5mgkg ', 
intraperitoneally) did not affect grooming behaviour of wild-type 
mice. However, this treatment significantly reduced the excessive 
grooming in Sapap3 ‘~ mice (Fig. 2a). Furthermore, fluoxetine 
dramatically reduced anxiety-like behaviours in Sapap3 ‘~ mice in 
the dark—light emergence test without affecting their total activity 
(Fig. 2b, c). In contrast, a single injection of fluoxetine (5mgkg"', 
intraperitoneally) had no effects on their excessive grooming 
behaviour (Fig. 2d). Thus, Sapap3‘~ mice exhibit compulsive-like 
grooming and increased anxiety, both of which are alleviated by 
6 days of fluoxetine treatment. 
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Figure 2 | Fluoxetine treatment alleviates excessive grooming and anxiety- 
like behaviour. a, Daily fluoxetine treatment over six days reduced 
grooming behaviour in Sapap3 ‘~ mice. b, c, Given over 6 days, fluoxetine 
alleviated anxiety-like behaviour of Sapap3 ‘~ mice in the dark—light 
emergence test (b) without affecting their total activity (c). d, A single 
injection of fluoxetine had no effect on the grooming behaviour of 

Sapap3 '~ mice. **P < 0.01, ***P < 0.001, repeated measures ANOVA; all 
data are presented as means ~ s.e.m. from 9-11 mice per group; Cohen’s 
kappa for intra-observer agreement exceeded 0.92. 


Altered cortico-striatal synaptic transmission 


The observations that SAPAP3 is the only member of the SAPAP 
family highly expressed in the striatum (Fig. 3a) and_ that 
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may be defects in striatal neurotransmission in Sapap3 '~ mice. 
Because SAPAPs are postsynaptic proteins of excitatory synapses'””” 
and directly bind PSD95 family proteins, which are known to regu- 
late the trafficking of both AMPA- and NMDA-type glutamate 
receptors (AMPAR and NMDAR, respectively)'?’', we focused on 
cortico-striatal synapses, which constitute the large majority of glu- 
tamatergic synapses in the striatum. We performed extracellular 
recordings from acute striatal brain slices. A field recording electrode 
was placed in the dorso-lateral striatum and a stimulating electrode 
was placed nearby in the corpus callosum. Recordings were obtained 
in the presence of picrotoxin, a GABA (y-aminobutyric acid ) 4 recep- 
tor antagonist, to block contaminating responses from intra-striatal 
GABAergic circuitry. We found that field excitatory postsynaptic 
potentials (fEPSPs) were significantly reduced in Sapap3 '~ mice 
compared with wild-type litter mates (Fig. 3b). Axonal excitability 
and presynaptic function were not different from wild type, as indi- 
cated by the slope of the input-output curves normalized to the peak 
response, the relationship of stimulation intensity to the amplitude of 
the action potential component (NP1; Supplementary Fig. 2a, b), and 
paired-pulse ratios (Fig. 3c), indicating that the reduction in total 
field responses was probably due to a postsynaptic impairment in 
synaptic transmission. 

We next evaluated NUDAR-dependent responses by recording in 
the presence of an AMPAR antagonist (NBQX) and the NMDAR co- 
factor glycine, in the absence of magnesium. Surprisingly, in contrast 
to the reduction we observed in the total fEPSP amplitude, which 
is dominated by AMPAR transmission under these conditions, 
the NMDAR-dependent fEPSPs were elevated in Sapap3 '~ mice 
(Fig. 3d, e). These findings indicate a differential effect of Sapap3 
deletion on extracellular field potentials that are dependent on 
AMPAR and NMDAR activity. We further tested whether the syn- 
aptic defects are unique to the striatum by examining synaptic trans- 
mission in the hippocampus, in which Sapap1, 3 and 4 are highly 
expressed’”'*. We found no defects in CA3—CA1 hippocampal basal 
synaptic transmission in Sapap3 '~ mice (Supplementary Fig. 2c, d), 
probably reflecting functional redundancy or compensation by other 
members of the SAPAP family. Together, these findings identify an 
important role for SAPAP3 in postsynaptic glutamatergic synaptic 
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Figure 3 | Altered cortico-striatal synaptic transmission in Sapap3-mutant 
mice. a, Only Sapap3 messenger RNA is highly expressed in the striatum 
(STR). b, Cortico-striatal field potential recordings of acute brain slices 
show decreased fEPSPs in Sapap3 ‘~ mice. P< 0.001, repeated measures 
ANOVA. Inset shows a typical field recording (black trace) and sensitivity to 
NBQX (red trace). NP1, negative peak 1. c, Paired-pulse ratios (slope of 
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fEPSP no. 2/slope fEPSP no. 1) are similar between wild-type and Sapap3 /~ 
mice. d, Example traces of NUDAR-dependent fEPSPs recorded in the 
presence of 50 tM NBQX, 0 mM Mg”* and 2 mM glycine. Sensitivity to APV 
is indicated by the red trace. e, NMDAR fEPSP area is increased in 
Sapap3 '~ mice. *P <0.05, two-tailed t-test; number of recordings in 
parentheses. All data are presented as means + s.e.m. 
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Altered NMDAR composition of the PSD in striatum 


To investigate the potential role of SAPAP3 in postsynaptic assembly, 
we examined the levels of a select group of PSD proteins that directly 
or indirectly interact with SAPAPs, using biochemically purified PSD 
fractions from the striatum of wild-type and Sapap3 '~ mice. We 
found no significant differences in the levels of PSD95, PSD93 or 
Shank proteins in the striatal PSD between the genotypes (Fig. 4a, 
b). However, the level of NRI, the obligatory subunit of the NMDAR, 
was significantly increased (Fig. 4a, b). This result is consistent with 
our data from extracellular recordings in the striatum, which showed 
increased NMDAR field potential responses. Although we cannot 
exclude the possibility that the larger NMDAR-dependent fEPSPs 
include a contribution from activated conductances other than 
NMDARs (a limitation of extracellular recordings), our electro- 
physiological data in combination with these biochemical findings 
indicate that an increase in NMDARs in the PSD may result in 
increased synaptic NMDAR activity in Sapap3'~ mice. Inter- 
estingly, we also found that the level of NR2B, the ‘juvenile’ subunit 
of the NMDAR, was significantly increased in the PSD of Sapap3 '~ 
mice, whereas NR2A, the ‘adult’ subunit, was significantly reduced 
(Fig. 4a, b). In contrast, we found no significant changes in the overall 
levels of NR1, NR2A and NR2B subunits in the total striatal lysate 
(Supplementary Fig. 3a, b), suggesting that the changes in these sub- 
units occur exclusively at the PSD. Synaptic NMDARs are known to 
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Figure 4 | Structural and biochemical analyses of cortico-striatal synapses 
in Sapap3-mutant mice. a, b, The levels of PSD95 (DLG4), PSD93 (DLG2) 
and Shank (SHANK1-3) in striatal PSD fractions are not affected in 
Sapap3 '~ mice. The levels of NR1 (GRIN1) and NR2B (GRIN2B) subunits 
are increased, whereas that of NR2A (GRIN2A) is decreased. B-actin and 
B-tubulin were used as loading controls. *P < 0.05, two-tailed t-test. All data 
are presented as means = s.e.m. ¢, d, Electron micrographs show the 
presence of synaptic vesicles (arrowheads), postsynaptic densities (arrows) 
and dendritic spines (asterisks). e, The length of the PSD is not significantly 
different in wild-type and Sapap3 ‘~ mice. f, g, The thickness of the dense 
layer (f) of the PSD in Sapap3 ‘~ mice, but not the light layer (g), is reduced. 
P<0.001, two-tailed t-test; n = 94 for wild type and n = 92 for Sapap3 /~ 
mice. 
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undergo a developmental switch from predominantly NR1/NR2B to 
NRI/NR2A or NR1/NR2A/NR2B during the first two postnatal 
weeks of life’? ’. Although the underlying molecular mechanisms 
of this developmental switch are not well understood, postsynaptic 
scaffolding proteins, including the PSD95 and SAPAP family of pro- 
teins, have been implicated in this process***'. Our data indicate that 
the NMDAR subunit composition at cortico-striatal synapses of 
adult Sapap3 ‘~ mice is similar to that of wild-type juvenile mice”””’, 
indicating that SAPAP3 may play an important part in synaptic 
NMDAR subunit switching and maturation at cortico-striatal 
synapses. 


Structure of cortico-striatal synapses in Sapap3’~ mice 


To determine whether these electrophysiological and biochemical 
defects are accompanied by morphological changes, we examined 
the spine density of medium spiny neurons in the striatum using 
both Golgi staining and Dil filling by Diolistics. We found similar 
spine densities on medium spiny neurons in Sapap3 '~ and wild- 
type mice at postnatal day (P)21 (Supplementary Fig. 3c—e) and in 
adults (data not shown), indicating that spine formation and main- 
tenance were not affected by the lack of SAPAP3 proteins. 

We next examined whether deletion of Sapap3 affects PSD ultra- 
structure at cortico-striatal synapses using electron microscopic 
analysis to measure the length and thickness of the PSD in the stria- 
tum (Fig. 4c, d). We found no significant difference in length of the 
PSD between Sapap3 ‘~ and wild-type mice (Fig. 4e). The PSD is 
reported to exhibit a laminar organization: NMDARs, scaffolding 
proteins and signalling proteins reside in the electron-dense layer 
near the synaptic membrane, whereas proteins linked to trafficking 
and the actin cytoskeleton reside in the filamentous fringe towards 
the cytoplasm”. Accordingly, we independently measured the thick- 
ness of the ‘dense layer’ and the fringe ‘light layer’. We found a small 
but significant reduction in thickness of the dense layer but not the 
light layer (Fig. 4f, g) of the PSD in striatal synapses in Sapap3 /~ 
mice, suggesting a subtle defect in the structure of the postsynaptic 
complex. This finding is consistent with the proposed role of SAPAP3 
as a postsynaptic scaffolding protein at excitatory synapses'””’. 


Rescue by striatal-specific expression of Sapap3 


Our findings of altered cortico-striatal synaptic function and struc- 
ture in Sapap3-mutant mice, along with pre-existing evidence for the 
role of striatal circuitry in the pathogenesis of OC-spectrum disor- 
ders led us to ask whether loss of SAPAP3 in the striatum was critical 
for the phenotypes of Sapap3 '~ mice. We therefore investigated 
whether selective expression of SAPAP3 in the striatum was sufficient 
to rescue the phenotypes of Sapap3 ‘~ mice. We generated lentiviral 
vectors that express either green fluorescent protein (GFP) alone or a 
GFP-SAPAP3 fusion protein. GFP was fused to SAPAP3 at the amino 
terminus, a region with no known protein-protein interaction sites 
or motifs. As expected, GEP-SAPAP3 fusion proteins properly loca- 
lized to synapses when transfected into cultured neurons (Supple- 
mentary Fig. 4a). 

Lentiviruses expressing either GFP or GEP-SAPAP3 were deliv- 
ered to the striatum of Sapap3 ‘~ mice by bilateral microinjections at 
P7. To minimize injury and maximize lentiviral infection, we gave 
injections at one anterior and one posterior site per hemisphere and 
delivered lentivirus to 8 locations per hemisphere by injection of 
lentivirus along the needle path (Fig. 5a; see Methods). Sustained 
expression of GFP and GFP—SAPAP3 was observed in both dorsal 
and ventral striatum in post-mortem examination of brains at the 
end of the study (Fig. 5b-e, and Supplementary Fig. 4b, c). 

At 4-6 months after injection, behavioural analyses were 
conducted to assess grooming, facial lesions and anxiety-like beha- 
viours. When examined by continuous videotaping, excessive 
grooming was markedly reduced in Sapap3 ‘~ mice injected with 
lentivirus encoding GFP—SAPAP3 in comparison to Sapap3 /~ mice 
injected with the control GFP lentivirus (Fig. 5f). Moreover, facial 
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lesion severity was also significantly reduced in Sapap3‘~ mice 
injected with GFP-Sapap3 lentivirus (Fig. 5g, h). All eight 
Sapap3 ‘~ mice injected with the control GFP lentivirus developed 
open skin lesions (six with large lesions, one with medium, and one 
with small open wounds). In contrast, only three of the eight 
Sapap3 ‘~ mice injected with GFP-SAPAP3 developed open skin 
lesions (one with large, one with medium, and one with small open 
wounds), whereas two had small hairless patches and three had nor- 
mal skin conditions (Fig. 5i). Furthermore, Sapap3 ‘~ mice injected 
with lentivirus encoding GFP—SAPAP3 also showed reduced anxiety- 
like behaviour in the dark—light emergence test (Fig. 5j, k), with no 
change in total activity (data not shown). Finally, lentiviral-mediated 
expression of GFP—SAPAP3 in the striatum also rescued defects in 
cortico-striatal synaptic transmission in Sapap3 ‘~ mice (Fig. 51, m). 
Together, these data show that early postnatal expression of GFP- 
SAPAP3 selectively in the striatum is sufficient to alleviate the major 
manifestations of the OCD-like phenotype in Sapap3 ‘~ mice. 


Discussion 


Although dysfunction of cortico-striato-thalamo-cortical circuitry 
has been widely implicated in OC-spectrum disorders, the nature 
of this dysfunction remains unclear. Our study of Sapap3-mutant 
mice has identified an important role for SAPAP3 at cortico-striatal 
synapses and, although there are inherent limitations of evaluating 
thought content in mice, our study further indicates that cortico- 
striatal synaptic defects may be central to the genesis of OCD-like 
behaviours. These conclusions are supported by several lines of evid- 
ence. First, Sapap3 ‘~ mice exhibit OCD-like behaviours: excessive 
time spent performing a ritualistic action to the point of being 
self-injurious, increased anxiety-like behaviours, and response of 
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these manifestations to a selective serotonin reuptake inhibitor. 
Second, SAPAP3 protein is localized to excitatory, but not inhibitory, 
synapses’*. Third, Sapap3 is the only member of the Sapap family that 
is highly expressed in the striatum. Fourth, Sapap3-mutant mice have 
reduced cortico-striatal synaptic transmission and an NMDAR sub- 
unit composition suggestive of immature cortico-striatal synapses. 
Finally, selective expression of Sapap3 in the striatum of Sapap3 '~ 
mice rescues the synaptic defects and OCD-like behaviours. 

Anxiety is attributed primarily to dysfunction of the amygdala and 
ventral hippocampus. Interestingly, SAPAP3, unlike SAPAP1 and 
SAPAP4, is not highly expressed in the amygdala (Supplementary 
Fig. 5a—d), whereas all four members of the SAPAP family are 
expressed in the ventral hippocampus (Supplementary Fig. 5e-h), 
suggesting that anxiety-like behaviour in Sapap3 '~ mice is unlikely 
to be due to the lack of SAPAP3 in amygdala or ventral hippocampus. 
The unique expression of SAPAP3 in the striatum and the alleviation 
of the anxiety-like behaviour by lentiviral-mediated expression of 
GFP-SAPAP3 in the striatum suggest that the anxiety-like behaviour 
in Sapap3 ‘~ mice originates from striatal defects. Because the amyg- 
dala projects to the striatum, it is also possible that some functions 
of amygdalar-striatal projections are impaired in Sapap3 '~ mice. 
Additionally, our video surveillance data indicate that Sapap3- 
mutant mice have a disrupted sleep pattern (Supplementary Fig. 
6), which is alleviated by fluoxetine treatment (Supplementary 
Fig. 7). This sleep disturbance may also contribute to the anxiety-like 
behaviour in Sapap3 ‘~ mice. 

A principal role of the striatum is to integrate the various inputs 
arriving from the cortex and use this information to select certain 
motor and/or cognitive programs (decision making), which are sub- 
sequently carried out through the direct and indirect pathways of 
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Figure 5 | Lentiviral-mediated rescue of behavioural and synaptic defects in 
Sapap3-mutant mice. a, Diagram showing the approximate locations of 
microinjections in the striatum of Sapap3 '~ mice. Injection site 1, locations 
1-3 are more anterior than injection site 2, locations 4-8. b—c, Brain sections 
froma Sapap3 ‘~ mouse injected with GFP lentivirus show GFP fluorescence 
(b, ¢, upper panels) and an absence of SAPAP3 staining (b, ¢c, lower panels). 
d-e, Brain sections from a Sapap3 ‘~ mouse injected with lentivirus 
encoding GFP-SAPAP3 show both GFP fluorescence (d, e, upper panels) and 
SAPAP3 immunostaining (d, e, lower panels). f, Compared with Sapap3 '~ 
mice with GEP, Sapap3 '~ mice with GEP-SAPAP3 showed significantly 
reduced over-grooming behaviour. **P < 0.01, two-tailed t-test; n = 8 mice 
per group for, i-k. g,h, Sapap3 ‘~ mice with GFP-SAPAP3 (h) had reduced 
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severity of facial lesions when compared to Sapap3 '~ mice with GFP 

(g). i, Semi-quantitative lesion scores. **P < 0.01, Mann-Whitney U test. 

j, k, Reduced anxiety-like behaviours in Sapap3 ‘~ mice injected with 
GFP-Sapap3 lentivirus in the dark—light emergence test, including decreased 
latency to cross from the dark to light chamber (j) and increased time in the 
light chamber (k). *P < 0.05, two-tailed t-test. I, m, Field recordings from 
infected striatal areas of P21-P25 Sapap3 '~ mice with GEP-SAPAP3 
showed an increase in cortico-striatal fEPSP amplitude (I) and a reduction of 
NMDAR-dependent fEPSP area (m). P< 0.001, repeated measures ANOVA 
for |; *P < 0.05, two-tailed t-test for m; 1 is 12 and 10 for I, and 10 and 9 for 
m for GFP-injected and GFP-Sapap3 injected mice, respectively. All data are 
presented as means + s.e.m. 
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the basal ganglia*. A prominent model for OCD is that an activity 
imbalance between the direct and indirect pathways leads to beha- 
vioural abnormalities of OCD®*, and recent studies have shown that 
the two pathways are differentially regulated at cortico-striatal 
synapses***°. We have identified defects in both AMPAR- and 
NMDAR-dependent synaptic transmission at cortico-striatal syna- 
pses. These data are from extracellular field recordings, representing a 
population average of the effects on individual neurons. If the 
observed effects are differentially manifested by medium spiny neu- 
rons in the direct or indirect pathway, an imbalance in activity could 
be readily envisioned. 

Studies of the mechanisms of OC-spectrum disorders have mostly 
focused on the serotonergic and dopaminergic systems. However, 
recent genetic association studies of OCD in humans have implicated 
genes important for glutamatergic neurotransmission** **. Together 
with our study of SAPAP3 in mice, these observations raise the pos- 
sibility that defects in excitatory synaptic transmission in the cortico- 
striatal circuit may contribute to the pathogenesis of OC-spectrum 
disorders in humans. 


METHODS SUMMARY 

Behavioural analysis. Sapap3-knockout mice were generated by homologous 
recombination in R1 embryonic stem cells using standard procedures”. Adult 
mice, age 4-8 months, were used for behavioural analyses. For analyses of 
grooming behaviours*’, habituated, individually housed animals were video- 
taped for 24h under 700 1x (day) and ~2 Ix (red light at night) illumination. 
Tests for anxiety-like behaviours were performed as described*!’. All experi- 
ments were done blind to genotypes. Because of the presence of obvious facial 
lesions in Sapap3-mutant mice, it is impossible to perform video decoding in a 
blind manner. To avoid bias, trained individuals unfamiliar with the project 
(listed in the Acknowledgements) were recruited to decode the videos. 
Cortico-striatal electrophysiology. Acute sagittal brain slices (300 um) from 
P17—P25 mice were used for all experiments. The field recording electrode was 
placed in the dorso-lateral striatum and a monopolar stimulation electrode was 
placed in the corpus callosum. All recordings were performed at 30-32 °C and in 
the presence of picrotoxin and all data were collected and analysed before 
unblinding of genotypes. 

Stereotaxic injection. One-week-old Sapap3 ‘~ mice were anaesthetized and 
placed in a mouse head holder. GFP lentivirus or GFP-Sapap3 lentivirus was 
bilaterally injected into the striatum through 2 sites at 8 locations per hemi- 
sphere. Behavioural analyses were performed 4—6 months after injection. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. A targeting vector was designed to replace exon 3 (containing the trans- 
lation initiation codon) of the Sapap3 gene with a NEO cassette. Genotypes were 
determined by PCR of mouse tail DNA, using primer Fl (ATTGGTAGG- 
CAATACCAACAGG) and RI (GCAAAGGCTCTTCATATTGTTGG) for the 
wild-type allele (147 base pairs), and Fl and R2 (CTITTGTGGTTCTAA- 
GTACTGTGG; in neo cassette) for the mutant allele (222 base pairs). Primer 
locations are indicated in Supplementary Fig. 1. 

Grooming behaviour’’. Because behaviours were most representative at the times 
02:00-06:00, 10:00—14:00 and 18:00-22:00, these segments were selected for ana- 
lyses using Noldus Observer software (Leesburg). The total amount of time in each 
4-h segment spent grooming, eating, sleeping, rearing, digging or pushing bedding 
(shifting) was determined, along with general states of activity. Grooming 
included all sequences of face-wiping, scratching/rubbing of head and ears, and 
full-body grooming. Bouts of behaviour lasted at least 3 s; pauses longer than 3 s 
constituted a new bout. Observer reliability was determined with Cohen’s kappa’. 
Anxiety-like behaviours’'*. Zero maze: an elevated zero maze"! was indirectly 
illuminated at 60 lx. Testing commenced with an animal being introduced into a 
closed area of the maze. Behaviour was video-taped for 5 min and subsequently 
scored by trained observers using the Noldus Observer (Nodus Information 
Technology). Anxiety-like behaviour was deduced on the basis of the percentage 
of time spent in the open areas and the latency to enter the open areas. Total 
activity time in the maze was an indicator of motor activity. Observer reliability 
was determined with Cohen’s Kappa. 

Open field: spontaneous locomotor activity was evaluated over 30 min in an 
automated Omnitech Digiscan apparatus (AccuScan Instruments) as described“. 
Locomotor activity was assessed as total distance travelled (cm). Anxiety-like 
behaviour was defined by the percentage of time spent in the centre as contrasted 
with the percentage of time spent in the perimeter (thigmotaxis) of the open field. 

Dark-light emergence test*’: Mice were adapted in an adjacent room to low 
light conditions (~40 Ix) and the test room was initially under similar illumi- 
nation. Testing was conducted in a two-chambered test apparatus (Med- 
Associates), with one side draped in black cloth (for example, dark-chamber) 
and the other illuminated at ~1,4001x (for example, light-chamber) with a 
170mA high intensity house light and overhead fluorescent lamps. 
Immediately on placing the mice into the darkened chamber, the lighted cham- 
ber was illuminated and the door between the two chambers was opened. The 
mice were allowed to freely explore the apparatus for 5 min. The latency to 
emerge from the darkened into the lighted chamber and the percentage of time 
spent in the illuminated chamber were used as indices of anxiety-like behaviours. 
Total motor activity in the two chambers was measured by infrared beam-breaks. 
Fluoxetine treatment. Adult mice 4-6 months of age were injected with fluox- 
etine (5 mg kg, intraperitoneally) once a day for one or six days and behaviours 
were tested 24h after the final injection. Grooming and anxiety-like behaviours 
were tested as described above. 

Skin analyses. Haematoxylin and eosin staining was performed on facial skin 
below the eyes by following a standard protocol. For visualization of sensory nerves 
in skin, Sapap3-mutant mice were crossed with Thy1GFP-J mice**. Mice were 
perfused transcardially with saline and 4% paraformaldehyde, and skin from just 
below the eyes was dissected and post-fixed for 24h. Confocal images were taken 
from 50-,1m cryosections. Skin lesions were scored as follows: 0, normal skin; 1, 
hairless patches; 2, small open-wound lesions, length <2 mm; 3, moderate open- 
wound lesions, length 2-4 mm; 4, large open-wound lesions, length >4 mm. 
Cortico-striatal electrophysiology. Recording perfusion solution contained (in 
mM): 119 NaCl, 2.5 KCl, 1.2 NaH,PO,, 26 NaHCOs, 1 MgCh, 2 CaCl, and 0.1 
picrotoxin (GABA, receptor antagonist). Slicing and recovery solutions were 
identical to perfusion solution except for containing 0.5 CaCl, and no picro- 
toxin. APV (50 1M) and NBQX (50 uM) were obtained from Tocris. Solutions 
were continuously equilibrated with 95% O, and 5% CO (pH7.4) and per- 
fusion flow rate was 2 ml min '. Slices were allowed to recover for a minimum of 
1h at 30-32 °C, following slicing. All recordings were performed at 30-32 °C. 

The field recording electrode was placed in the dorso-lateral striatum and a 
monopolar stimulation electrode was placed in the corpus callosum. Current 
was delivered to the stimulating electrode using an AMPI. Stimulus Isolator 
(AMPI) for 150s. Three distinct components were resolved in the majority 
of recordings: stimulation artefact, negative peak 1 (NPI, action-potential- 
derived on the basis of latency, resistance to NBQX and picrotoxin, and sens- 
itivity to tetrodotoxin) and negative peak 2 (NP2, fEPSP based on latency and 
sensitivity to NBQX; in addition, sensitivity to tetrodotoxin indicates the res- 
ponse is not due to direct activation by stimulating electrode current). 
The callosal stimulation site was chosen over an intra-striatal site to minimize 
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activation of non-cortical axons. Data were acquired at 20 kHz and filtered at 
2kHz using MultiClamp 700B amplifier and pClamp 10.0 software (Axon 
Instruments). Data were analysed offline using Clampfit 10.0 (Axon 
Instruments). Five consecutive responses were averaged before measuring 
amplitude, slope or area in the respective assays. When amplitudes are reported, 
similar conclusions were obtained by slope analysis. Paired-pulse responses were 
evoked using a stimulation intensity that yielded the maximal fEPSP response. 
Slope values used for paired-pulse ratios refer to slope during the period from 
20-80% of the peak response. NMDAR fEPSPs were evoked using the same 
stimulation intensity that yielded the maximal fEPSP response under the basal 
recording conditions used to generate the input-output curves. The area of 
NMDAR field potential responses was measured during a standard 20-ms time 
window beginning approximately 8 ms after stimulation. All data were collected 
and analysed before unblinding of genotypes. In addition, NMDAR field poten- 
tial recordings and correlative PSD biochemical studies were performed in two 
different laboratories and were unblinded at the same time. 

For in vivo viral expression rescue experiments, the same slicing, recording, and 
analysis methods were used as described above except that the stimulating and 
recording electrodes were placed within the fluorescent green tissue. Only slices 
where green cells were visible in the dorsolateral recording region were used. Viral 
rescue experiments were performed and analysed blind to the viral identity. 
Diolistics. ‘Bullets’ were prepared by coating tungsten particles (Bio-rad) 
with Dil (Molecular Probes-Invitrogen) dissolved in methylene chloride as 
described*’. Mice were perfused transcardially, first with normal saline, and then 
with buffered 4% paraformaldehyde. Fixed mice were decapitated and the head 
was allowed to post-fix for 24-48 h in buffered 4% paraformaldehyde at 4°C. 
Brains were dissected and 250 um coronal slices were cut on a vibratome. 

Diolistic filling of individual cells was performed using a standard Bio-Rad 
biolistic system. Slices were placed into the interior chamber of a Falcon organ 
tissue culture dish (catalogue number 3,037) and excess buffer was removed. A 
Millipore 3.0-11m Isopore filter (catalogue number TSTP04704) was placed to rest 
on top of the tissue culture dish and a Kimwipe was placed on top of the Millipore 
filter. Bullets were shot through the Kimwipe and filter barrier using a helium charge 
of 552 kPa (80 psi). Individual labelling of 3-15 striatal medium spiny neurons was 
successful in approximately 50% of slices. Following the diolistic procedure, slices 
were allowed to incubate in buffer at 4°C for 1-2h. Following incubation, slices 
were immediately mounted and imaged on a confocal microscope. 

Lentivirus production and stereotaxic injection. The lentivirus vector was based 
on FUGW* and modified by replacing the ubiquitin promoter with a CMV 
promoter to improve neuronal expression. GFP was fused to the amino terminus 
of SAPAP3 to generate the GFP-SAPAP3 fusion protein. The complementary 
DNA clone encoding GFP or GFP-SAPAP3 was placed downstream of the 
CMV promoter between BamHI and EcoRI sites“*. Viral particles were produced 
by transient transfection of 293T cells with the lentiviral vector encoding GFP or 
GFP-SAPAP3 along with the envelope (VSVg) and packaging (A8.9) vectors. High 
titre viral suspensions were obtained by ultra-centrifugation at 77,000g for 90 min. 

One-week-old Sapap3”'~ mice were anaesthetized with a mixture of ketamine 
and xylazine (intraperitoneally), and placed in a mouse head holder (David Kopf 
Instruments). Lentivirus encoding GFP or GFP—SAPAP3 was bilaterally injected 
into the striatum through two sites at eight locations per hemisphere. At each of 
the injection sites, the microinjection needle was advanced to the deepest (vent- 
ral) position for the first injection; additional injections were made every 0.3 mm 
while withdrawing the injection needle. The coordinates from Bregma were: 
injection site 1, location 1-3: anterior 0.5 mm, mediolateral 1.4mm, dorsoven- 
tral 2.8, 2.5, 2.2 mm; injection site 2, location 4-8: anterior 0.2 mm, mediolateral 
1.7 mm, dorsoventral 3.3, 3.0, 2.7, 2.4, 2.1 mm. For each injection location, 90 nl 
of virus was injected using Nanoject II (Drummond Scientific), and the needle 
was left in place for 3 min after each injection. 

Electron microscopic analysis. Striatal tissues for electron microscopic analysis 
were processed and quantitatively analysed according to protocols previously 
described’. Synapses with clear membranes (likely to be cut orthogonal to the 
plane of the synaptic membrane) were photographed at X 15,000 magnification. 
Data collection and analysis were performed by observers blinded to genotypes. 
For every clearly defined synapse, data for PSD length and PSD thickness were 
collected. 

PSD preparation and western blot. PSD fractions of the striatum were prepared 
as described'”*’, separated by SDS-PAGE and probed with specific antibodies. 
The relative amount of B-tubulin and B-actin were used as loading controls for 
quantification. Antibodies for SAPAP3, PSD95, PSD93, Shank and NR2B have 
been described previously'”?*°. Antibodies for NR1 (mouse monoclonal) and 
NR2A (rabbit polyclonal) were from BD Biosciences and Upstate, respectively. 
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Gap junction adhesion is necessary for 
radial migration in the neocortex 


Laura A. B. Elias’”, Doris D. Wang’? & Arnold R. Kriegstein? 


Radial glia, the neuronal stem cells of the embryonic cerebral cortex, reside deep within the developing brain and extend 
radial fibres to the pial surface, along which embryonic neurons migrate to reach the cortical plate. Here we show that the gap 
junction subunits connexin 26 (Cx26) and connexin 43 (Cx43) are expressed at the contact points between radial fibres and 
migrating neurons, and acute downregulation of Cx26 or Cx43 impairs the migration of neurons to the cortical plate. 
Unexpectedly, gap junctions do not mediate neuronal migration by acting in the classical manner to provide an aqueous 
channel for cell-cell communication. Instead, gap junctions provide dynamic adhesive contacts that interact with the 
internal cytoskeleton to enable leading process stabilization along radial fibres as well as the subsequent translocation of the 
nucleus. These results indicate that gap junction adhesions are necessary for glial-guided neuronal migration, raising the 
possibility that the adhesive properties of gap junctions may have an important role in other physiological processes and 


diseases associated with gap junction function. 


During neocortical brain development, radial glia, which reside in 
the ventricular zone and extend a fibre to the pial surface, serve two 
purposes: (1) as stem cells of the developing cortex, giving rise to 
neurons’, and (2) as guides along which the neurons migrate to 
reach the correct lamina of the cortical plate, where they will become 
pyramidal cells in the adult cortex* >. Interstitial junctions containing 
filamentous material have been found between migrating neurons 
and radial fibres, suggesting a close association between the two cell 
types®*. 

The molecules that mediate the critical interaction between the 
radial glial fibre and the migrating neuron remain largely unknown. 
Astrotactin has been described as an adhesion molecule in the cere- 
bellar cortex”’®, but has not been implicated in neocortical develop- 
ment. Neuregulin and its soluble form, glial growth factor, mediate 
bi-directional signalling between the radial fibre and the migrating 
neuron, and are important for radial glial maintenance and neuronal 
migration’. «38, integrin expressed in migrating neurons is involved 
in glial recognition, and oy integrin expressed in radial glia is 
involved in neuronal adhesion’. The expression of gap junctions 
in radial fibres and migrating neurons, together with the detection 
of gap junctions between nestin™ radial glial fibres and nestin” cells 
by electron microscopy, suggests the possibility that gap junctions 
play a part in neuron-glial interactions and neuronal migration’. 
This hypothesis is supported by the change in the distribution of 
bromodeoxyuridine (BrdU) pulse-labelled cells in the Cx43 genetic 
knockout (Cx43KO)". Furthermore, strong evidence links Cx43 to 
migration in neural crest cells and gliomas; however, the mechanisms 
are not fully understood’**. 

Gap junctions are large-diameter channels made up of two hemi- 
channels—each composed of six connexin subunits—on opposing 
membranes that join through hydrophobic interactions and form an 
aqueous pore between the cytoplasm of two adjacent cells’’. The gap 
junction subunits expressed in the developing cortex include Cx26 
(also known as Gjb2), a B1 connexin family member, and Cx43 
(also known as Gjal), an %1 connexin family member”’. There is 
strong evidence that radial glia are coupled electrically and chem- 
ically to each other by gap junction channels during embryonic 


development”’. Functionally, it has been suggested that gap junctions 
have a role in radial glial cell cycle regulation because phar- 
macological block of gap junctions, or Ca** waves mediated by 
gap junction hemichannels, inhibits the cell cycle””’. However, the 
possible role of gap junctions in cortical migration remains largely 
unexplored. 


Gap junction expression at contact points 


The gap junction subunits Cx26 and Cx43 are expressed in the deve- 
loping rat neocortex. Whereas Cx26 is evenly distributed from the 
ventricular zone through the intermediate zone to the cortical plate, 
Cx43 is highly expressed at the ventricular surface and its levels are 
reduced in the cortical plate (Fig. la, b, e, f). Cx26 and Cx43 puncta 
do not colocalize with each other (Supplementary Fig. 1), because 
they belong to separate families that do not make heterotypic junc- 
tions or heteromeric hemichannels™*. The expression of both con- 
nexins outside of the ventricular zone proliferative region indicates 
that gap junctions may regulate functions additional to radial glial 
proliferation. To address this possibility, we determined the pattern 
of Cx26 and Cx43 expression with respect to B-III tubulin* migrat- 
ing neurons and vimentin™ radial glial fibres. Both gap junction 
subunits are expressed in migrating neurons and along radial fibres. 
In fact, Cx26 and Cx43 are highly localized in neurons to the regions 
of contact with radial fibres, consistent with the idea that gap junc- 
tions may have a role in mediating communication between migrat- 
ing neurons and their radial guides (Fig. 1, and Supplementary 
Movies 1, 2). 


Cx26 and Cx43 are necessary for migration 

To investigate the role of gap junctions in cortical development, we 
developed acute loss-of-function manipulations for each connexin 
using RNA interference. Short hairpin RNA (shRNA) constructs that 
produced significant knockdown of rat Cx26 or Cx43 by western blot 
in Cos-7 monkey cells were selected (Cx26-shRNA or Cx43-shRNA, 
respectively), and it was confirmed that the same hairpin sequences 
with three point mutations (Ctrl-shRNA) failed to produce signi- 
ficant knockdown (Supplementary Fig. 2a, b). We confirmed that 
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Cx26-shRNA and Cx43-shRNA were able to knockdown endogenous 
connexin protein in the developing rat brain (Supplementary Fig. 
2c, d). 

We next examined the functional effects of Cx26 or Cx43 down- 
regulation in the intact developing neocortex. In utero intraventri- 
cular injection and electroporation of Cx26-shRNA or Cx43-shRNA 
at embryonic day (E)16 resulted in a striking cellular redistribution 
pattern compared with Ctrl-shRNA at E18 and E21 (Fig. 2a). We 
quantified this effect by dividing the cortex into five equal areas and 
determining the fraction of the total number of shRNA-expressing 
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Figure 1 | Cx26 and Cx43 are localized to contact points between migrating 
neurons and radial glial fibres. a, e, Immunohistochemistry of cortical 
sections at E16 showing overlay of Cx26 (a) or Cx43 (e) (red), vimentin 
(green) to label radial fibres, and B-III tubulin (blue) to label immature 
neurons. b, f, Red channel of panels a and e showing expression pattern of 
Cx26 (b) or Cx43 (f). ¢, g, Migrating neuron from boxed area of panels a and 
e; clockwise: overlay, vimentin (green), Cx26 (¢) or Cx43 (g) (red), and B-III 
tubulin (blue). Cx26 and Cx43 are expressed in neurons and radial glia 
(white arrows). d, h, Overlay of Cx26 (d) or Cx43 (h) and f-III tubulin in 
migrating neuron (from panels ¢c and g). i, j, ‘High resolution’ confocal stacks 
highlighting the relationship between Cx26 (i) or Cx43 (j) puncta, vimentin 
fibres and a B-III tubulin*® neuron (dotted outline). Cross sections through 
the x and y axis, bottom and right panels, respectively, show individual 
connexin puncta (white arrows) at the interface between the vimentin ~* 
radial fibre and B-III tubulin® migrating neuron (Supplementary Movies 1 
and 2 provide three-dimensional rotations). Scale bars, 5 jm. CP, cortical 
plate; IZ, intermediate zone; VZ, ventricular zone. 
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(GEP*) cells in each area (Fig. 2b). At E18, there was a significant 
change in the fraction of cells in all areas except in the ventricular 
zone; most clearly there was a loss of cells in the lower and upper 
cortical plate (Fig. 2b). The effect was more dramatic at E21; the 
percentage of cells in each area, except the ventricular zone/SVZ 
(sub-ventricular zone), was significantly different. Most strikingly, 
there was an accumulation of cells in the intermediate zone, with far 
fewer cells in the cortical plate (Fig. 2b). The intermediate phenotype 
of the Cx26-shRNA is due to a rostral caudal gradient such that Cx26- 
shRNA exerts a stronger effect on neuronal migration in caudal 
regions where Cx26 is highly expressed (Supplementary Fig. 3). 
The target-specificity of these results was confirmed by using alterna- 
tive shRNA targets for Cx26 and Cx43 (Supplementary Fig. 4). 
Because radial glia express connexins, it was possible that the defect 
in neuronal migration was due to a disruption in the radial glial 
scaffolding. However, immunohistochemical analysis of the radial 
glia, as well as the migration of wild-type neurons in electroporated 
regions, seemed normal (Supplementary Fig. 5). To address this 
possibility further, transplant assays were performed. Brains were 
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Figure 2 | Knockdown of Cx26 or Cx43 impairs neuronal migration. 

a, Coronal sections from brains electroporated at E16. Cx26-shRNA or 
Cx43-shRNA reduces the fraction of cells in the cortical plate (white arrows) 
at E18 and E21. b, Quantification of shRNA knockdown-mediated cell 
distribution effect. Left panels, representative optical section with grid 
overlay of five equal areas (A—E) and quantification threshold. Right panels 
show a significant change in the distribution of cells in all areas except A at 
E18 and E21 (1-way ANOVA, E18: A, P = 0.0745; B, P = 0.0010; C, 
P=0.0106; D, P= 0.0001; E, P< 0.0001; E21: A, P = 0.3625; B, P= 0.0001; 
C, P = 0.0003; D, P< 0.0001; E, P= 0.0001; n = 3 brains per condition). 
Values represent mean + s.e.m. c, Connexin expression in neurons is 
necessary for migration. Ctrl-shRNA-expressing cells transplanted into a 
wild-type host brain are able to integrate and migrate, whereas transplanted 
Cx26-shRNA- or Cx43-shRNA-expressing cells have reduced migration 
capacity (Chi-square, Ctrl-shRNA versus connexin-shRNA: A, P < 0.0001; 
B, P< 0.0001; C, P> 0.05; D, P< 0.0001; E, P< 0.0001; Ctrl-shRNA, 

n = 225 cells; Cx26-shRNA, n = 421; Cx43-shRNA n = 698). The pial 
surface (pia) and ventricle (vent.) are labelled to indicate orientation. 
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electroporated at E16 and shRNA-expressing (GFP*) cells were 
microdissected and intraventricularly injected into wild-type brains 
at E17. In contrast to donor Ctrl-shRNA expressing cells, which were 
able to engraft and migrate towards the upper layers of the cortex, the 
Cx26-shRNA- and Cx43-shRNA-expressing donor cells were able to 
engraft into the host brain but had a significantly reduced migration 
capacity (Fig. 2c). These data suggest that connexin expression in 
neurons is required for migration. 

We next examined secondary defects potentially able to affect 
cell distribution independent of migration. Neurons expressing Cx- 
shRNA were able to exit the cell cycle and begin to differentiate 
normally (Supplementary Fig. 6a, b). Furthermore, terminal trans- 
ferase dUTP nick end labelling (TUNEL) staining showed no change 
in apoptosis (Supplementary Fig. 6c). Additionally, the expression 
pattern of N-cadherin, zona occludens-1 and {-integrin seemed 
unchanged, suggesting that connexin knockdown does not overtly 
disrupt expression of other cell-cell adhesion proteins (Supplemen- 
tary Figs 7, 8). Together these data suggest that the defect in migra- 
tion is not secondary to a defect in cell cycle exit, differentiation, cell 
death, or downregulation of other adhesion proteins. 


Adhesions, not channels, mediate migration 
What is the mechanism by which gap junctions mediate neuronal 
migration? We sought to differentiate among three plausible func- 
tions. Gap junctions can electrically and chemically couple cells to 
allow the exchange of current or small molecules, mediate extra- 
cellular release of substrates such as ATP through hemichannels, or 
provide an adhesive contact between two cells. We performed select- 
ive rescue experiments to distinguish these possibilities. 

First, to rescue the migration phenotype, conservative mutations 
(CM) were introduced into the shRNA-targeted regions of Cx26 
or Cx43 (Cx26CM or Cx43CM respectively), which prevented 
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shRNA-induced knockdown (Supplementary Fig. 9). These con- 
structs, or enhanced yellow fluorescent protein (EYFP) alone, were 
electroporated with the shRNA constructs at a set molar ratio. The 
co-expression of Cx43CM or Cx26CM with its respective shRNA 
construct significantly rescued the fraction of cells that migrate to 
the cortical plate, demonstrating the target-specificity of the shRNA 
effect (Fig. 3). 

To determine whether functional channels are necessary to rescue 
the migration defect, a conserved tyrosine in the third transmem- 
brane domain of Cx26 and Cx43 was mutated (Cx26CMT135A or 
Cx43CMT154A, respectively). These dominant-negative connexin 
mutants are able to make adhesions but have a closed channel and 
thus are not able to mediate exchange between cells or with the extra- 
cellular environment” (Supplementary Figs 10, 11). Remarkably, 
these mutants rescued the migration of cells to the cortical plate 
(Fig. 3). The ability of the closed channel mutants to rescue the 
migration defect strongly suggests that the channel (either hemichan- 
nel or gap junction channel) does not contribute to the role of gap 
junctions in neuronal migration. 

It has been shown that Ca’~ is important for neuronal migration”® 
and that gap junctions mediate Ca" waves in the developing cortex 
by releasing ATP through hemichannels that activates purinergic 
P2Y1 receptors on radial glia**. Thus, to further test the possibility 
that hemichannel-mediated Ca** waves have a role in neuronal 
migration, Ca”* waves were inhibited by pharmacologically ant- 
agonizing the P2Y1 receptor with suramin or by knocking down 
the P2Y1 receptor with multiple shRNA constructs. No migration 
defect was observed, suggesting that hemichannel-mediated Ca** 
waves are not involved in cortical neuronal migration (Supplemen- 
tary Fig. 12). 

To determine whether the adhesive properties of gap junctions are 
necessary for migration, one of the conserved extracellular cysteines 


Cx43-shRNA + *** 
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Figure 3 | Rescue of connexin-shRNA induced migration phenotype by 
wild-type connexin and connexin mutants that make adhesions but not 
channels. a, Cx43CM (function: channel, adhesion, C terminus), 
Cx43CMT154<A (function: adhesion, C terminus), or Cx43CMT154A 1-238 
(function: adhesion) significantly increased the fraction of cells in the 
cortical plate compared to EYFP when co-expressed with Cx43-shRNA 
CP < 0.0001, t-test; EYFP, n = 6 brains; Cx43CM, n = 4; Cx43CMT154A, 
n= 5; Cx43CMT154A 1-238, n = 6). Cx43CMC61S (function: 
hemichannel, C terminus) did not rescue the migration defect (EYFP versus 
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Cx43CMC61S, t-test P = 0.4727; Cx43C61S, n = 4 brains). b, Cx26CM 
(function: channel, adhesion, C terminus), Cx26CMT135A (function: 
adhesion, C terminus), or Cx26CMT135A 1-216 (function: adhesion) 
significantly increased the fraction of cells in the cortical plate compared to 
EYFP when co-expressed with Cx26-shRNA (***P = 0.0001, t-test; EYFP, 
n = 4 brains; Cx26CM, n = 5; Cx26CMT135A, n = 5; CxX26CMT135A, 
1-216, n = 5). For 26-shRNA rescue, only caudal (caud.) sections (as 
described in Methods) were used, because the migration phenotype is 
stronger caudally (Supplementary Fig. 3). Values represent means + s.e.m. 
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was mutated (Cx43CMC61S) to produce a Cx43 channel that does 
not make adhesions and thus cannot form gap junctions’’, but main- 
tains the ability to make functional hemichannels (Supplementary 
Figs 10, 11). Interestingly, Cx43CMC61S failed to rescue the Cx43- 
shRNA-induced migration defect, suggesting that the adhesive prop- 
erties of gap junctions are necessary for radial migration (Fig. 3a). 

In addition, we wanted to determine whether signalling by car- 
boxy-terminal interactions with cytosolic proteins are important. 
Cx26 has a short, uncharacterized intracellular C terminus, whereas 
Cx43 has a longer C terminus, which interacts with a variety of pro- 
teins including zona occludens-1, V-Src, and tubulin’”*’. Both C- 
terminal truncations of the closed channel mutants (Cx43CMT154A 
1-238 and Cx26CMT135A 1-216) were able to rescue the migration 
defect (Fig. 3). These results suggest that signalling mediated by the 
cytoplasmic C termini does not have a significant role during migra- 
tion. However, the possibility remains that intermolecular inter- 
actions with other parts of the connexin protein may be important. 
In summary, we found that adhesion, but not the channel or the 
C terminus, is necessary for the role of gap junctions during neuronal 
migration. 


Gap junctions provide adhesion 


To demonstrate that gap junctions can mediate adhesion in cortical 
cells, we built on previous work showing that overexpression of Cx43, 
but not Cx43C61S, increases adhesion in C6 glioma cells'’—a cell type 
without endogenous gap junction expression*’. We used C6 cells with 
or without stable transfection of Cx43—EYFP*' to provide a glial sub- 
strate on which we could place embryonic cortical cells and test their 
ability to adhere. Cortical cells were electroporated with Ctrl-shRNA, 
dissected and FACS-sorted for GFP, labelled with DiD, and plated on 
C6 cells or C6 cells expressing Cx43—EYFP (C6 + Cx43). We found a 
twofold increase in the number of DiD* cells adhering to the 
C6 + Cx43 substrate compared with the C6 substrate, suggesting that 
cortical cells can use endogenous Cx43 as a means of cell-cell adhesion 
(Fig. 4a). 

To test whether Cx43-shRNA reduces the ability of cortical cells to 
adhere to the C6 + Cx43 substrate, Ctrl-shRNA or Cx43-shRNA 
FACS-sorted cells were plated on C6 cells or C6 + Cx43 cells. 
There was a significant reduction in the ratio of the number of 
Cx43-shRNA to Ctrl-shRNA cells on the C6 + Cx43 substrate com- 
pared with the C6 substrate (Fig. 4b). Thus, we show that cortical cells 
can use endogenous Cx43 to adhere to a glial substrate and that 
knocking down Cx43 reduces their ability to adhere to a Cx43- 
expressing substrate, but not a control substrate. 

Classical adhesion molecules typically provide a link with internal 
cytoskeletal components. Previous work in neural crest cells demon- 
strates that Cx43 colocalizes and co-immunoprecipitates with 
numerous actin-binding proteins, and that filamentous actin is dis- 
organized in neural crest cells from the Cx43 knockout mouse”. To 
explore connexin—actin interactions, actin tagged with the red fluor- 
escent protein cherry (actin—cherry) was co-electroporated with 
GFP. Electroporated actin is diffusely distributed in radial glia and 
newborn neurons and becomes punctate in multipolar SVZ cells and 
migrating neurons (Supplementary Fig. 13). Co-electroporation of 
Cx26T135A or Cx43T154A (to avoid toxicity associated with over- 
expression of wild-type connexin; Supplementary Fig. 11) with 
actin—cherry revealed that both Cx26 and Cx43 puncta frequently 
colocalize with the actin puncta, with the connexin puncta slightly 
offset (Fig. 4c). Interestingly, cells expressing Cx26-shRNA or Cx43- 
shRNA, have significantly fewer actin puncta than Ctrl-shRNA- 
expressing cells (Fig. 4d). This suggests that both Cx26 and Cx43 
can interact with the actin cytoskeleton. 


Connexin dynamics during migration 


To investigate the dynamic behaviour of migrating neurons, we per- 
formed time-lapse live imaging. We observed that control migrating 
neurons typically start with a bifurcated leading process, one branch 
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of which prevails, and the neuron’s soma then translocates forward 
into a dilation that forms in this leading process* (Fig. 5a, Supple- 
mentary Movie 3). Interestingly, migrating neurons with multiple 
leading processes can show differential expression of Cx43 or Cx26 
in these processes (Fig. 5b). Hence, we hypothesized that gap junc- 
tion adhesions play a part in stabilizing the leading process along a 
radial glial fibre. Indeed, Cx43-shRNA- and Cx26-shRNA-expressing 
neurons are unable to stabilize their processes and continue to extend 
multiple branches (Fig. 5c, Supplementary Movie 4, 5). This pheno- 
type is reminiscent of neural crest cells from Cx43KO mice that show 
increased protrusive activity but decreased directional migration”. 

To analyse the dynamics of connexin puncta localization during 
neuronal migration, the Cx43T154A—EYFP or the Cx26T135A— 
EYFP plasmid was co-electroporated with a plasmid expressing the 
red fluorescent protein tomato to visualize the cells, and followed by 
time-lapse microscopy. The closed pore connexin mutants were 
expressed in radial glia and migrating neurons in a pattern that 
resembled wild-type expression, but avoided the toxicity associated 
with wild-type connexin overexpression (Supplementary Fig. 11). 
Furthermore, because these mutants were sufficient to rescue migra- 
tion, we can assume that they behave similarly to wild-type connexins 
with respect to neuronal migration. 
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Figure 4 | Gap junctions promote cortical cell adhesion and interact with 
the internal actin cytoskeleton. a, Control DiD-labelled (red), FACS-sorted 
cortical cells were allowed to adhere to a glial substrate of confluent C6 or 
C6 + Cx43 cells. Right panel, quantification of the number of adherent 
DiD* cells per image frame (**P = 0.001, t-test, n = 3 experiments). b, DiD- 
labelled, FACS-sorted Ctrl-shRNA or Cx43-shRNA expressing cells were 
plated on a C6 or C6 + Cx43 glial substrate. The ratio of Cx43-shRNA to 
Ctrl-shRNA cells that adhered to the C6 + Cx43 substrate was about 20% 
less than to the C6 substrate (**P < 0.001, t-test; C6, n = 3 experiments; 
C6 + Cx43, n = 4 experiments). Values represent mean + s.e.m. 

c, Cx26T135A—EYFP (top) or Cx43T154A—EYFP (bottom) puncta 
colocalize with actin—cherry (red) (cell bodies outlined in white), with their 
centre of fluorescence 0.5—1 |tm apart in the z-plane (coloured arrows). 
Graph shows a frequency histogram of the distance between the centre of the 
puncta. d, Fewer actin puncta are observed in cells expressing Cx26-shRNA 
or Cx43-shRNA than those expressing Ctrl-shRNA (**P < 0.001, t-test; Ctrl- 
shRNA, n = 37 cells; Cx26-shRNA, n = 21; Cx43-shRNA, n = 44). Images 
taken at ‘high resolution’. Values represent mean + s.e.m. Scale bars, 5 Lum. 
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The localization of Cx43T154A puncta in the branches of bifurcated 
leading processes was predictive of the dominant branch that would 
be maintained over time, and the absence of puncta was predictive of 
the transient branch (Fig. 5d, e, and Supplementary Movie 6). In the 
rare case in which a radial fibre was visible, stabilization of the branch 
along the fibre was observed (Fig. 5d, cell 2). When Cx26T135A was 
observed in the branches of bifurcated neurons, it also localized to the 
dominant branch (Supplementary Fig. 14a); however, Cx26T135A 
was not frequently present in the branches. 

Cx26T135A was often localized to puncta in the cell body. 
Interestingly, Cx26T135A puncta in the cell body move into the dila- 
tion before the movement of the nucleus in approximately 81% of 
translocation events (n = 56) (Fig. 5f, Supplementary Movie 7 and 
Supplementary Fig. 14b). Cx43T154A puncta found in the cell body 
followed a similar pattern in approximately 57% of translocation events 
(n= 24) (Supplementary Fig. 14c). This stereotyped pattern of con- 
nexin puncta movement, which mimics the pattern of centrosome 
movement, indicates that gap junctions may provide the hypothesized 
adhesion contact associated with the centrosome in the dilation of 
translocating neurons**™. This hypothesis is supported by the obser- 
vation that Cx26T 135A and Cx43T 154A puncta colocalize with centrin 
II, where the connexin puncta are slightly offset towards the cell 
membrane (Fig. 5g, and Supplementary Fig. 14d). Other patterns of 
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Figure 5 | Gap junction adhesions have a role in the stabilization of the 
migrating neuron’s leading process and in the translocation of the soma. 
a, Time-lapse of two control migrating neurons (white arrowheads), 
demonstrating the sequence through which bifurcated neurons (t = 0-1 h) 
select a dominant leading process (t = 1-2 h) and translocate their soma. 

b, Immunohistochemistry for Cx43 (top) or Cx26 (bottom) in a B-III 
tubulin” bifurcated neuron or neuronal process (relative connexin 
expression, white arrows > yellow arrow). Scale bars, 5 jm. ¢, Time-lapses of 
neurons expressing Cx43-shRNA (left) or Cx26-shRNA (right) (see 
Supplementary Movies 4 and 5 for complete time-lapse). Neurons fail to 
stabilize a dominant process (note multiple transient processes), and the 
Cx26-shRNA neuron fails to translocate its soma. d, Time-lapses of 
Cx43T154A puncta in branches of bifurcated neurons. The dominant process 
(white arrowheads) contains numerous connexin puncta (circled in white 
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connexin dynamics in the trailing process and between radial fibres 
and neurons were also observed (Supplementary Fig. 14e, f). 

Overall, the dynamics of connexin puncta localization and of 
shRNA electroporated cells suggest that both Cx43 and Cx26 have 
important roles in leading-process stabilization and somatic trans- 
location, with Cx26 playing a more prominent part at the soma and 
Cx43 in the branches (Fig. 5h). Such a functional separation is con- 
sistent with the observation that Cx26 and Cx43 do not make hetero- 
meric gap junctions, and localize to separate domains in gap junction 
plaques* (Supplementary Fig. 1). 


Discussion 


This study indicates that gap junctions mediate glial-guided radial 
migration of newborn neurons in the cortex. A recent study using 
conditional genetic deletion of Cx43 indicates that gap junctions 
may, in addition, mediate radial migration in the hippocampus 
and cerebellum*’. We find that gap junctions can promote adhesion 
in cortical cells, and that their adhesive but not their channel prop- 
erties are necessary for migration. These dynamic gap junction adhe- 
sions between migrating neurons and radial glial fibres regulate 
properties of neuronal migration, including branch stabilization 
and nuclear translocation. The migration defect described here with 
acute knockdown of Cx43 is more dramatic than the phenotype 
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when multiple branches), whereas transient processes (blue arrowheads) do 
not. e, The number of puncta in the dominant (Dom.) and transient (Trans.) 
processes (***P = 0.0001, paired t-test, n = 13). Values represent 

mean = s.e.m. f, Time-lapse of a Cx26T135A punctum in the cell body of a 
migrating neuron. The punctum (white arrowhead) moves into the dilation 
in the leading process before the translocation of the cell body (blue 
arrowhead). g, Cx26T135A and centrin II—dsRed colocalize (white arrows) in 
the dilation of a migrating neuron (left panels), with their centres of 
fluorescence approximately 1 jum apart in the z-plane (middle panels, 
coloured arrows). Frequency histogram of the distance between the centres of 
the puncta (right panel). Scale bar, 1 um. h, Model of gap junction adhesion 
dynamics in migrating neurons. Gap junction adhesive contacts stabilize the 
dominant leading process along radial fibres and provide an anchor in the 
dilation of the leading process before translocation of the soma/nucleus. 
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described in the Cx43KO mouse™. This may be due to developmental 
compensation in the genetic knockout arising from redundancy 
among connexins. Indeed, we show at least one other connexin, 
Cx26, has a very similar role in neuronal migration. Because only 
Cx43 has been previously implicated in cell migration, the role of 
Cx26 is interesting and implies that other connexins, including B1 
family connexins, may be involved in migration. In addition, our 
work indicates that different connexins may have distinct functional 
roles in neural migration, with Cx26 playing a dominant role in 
nuclear translocation and Cx43 in branch stabilization. Further- 
more, because Cx43 associates with other adhesion molecules includ- 
ing N-cadherin and integrin*”**”,, it will be of interest to explore their 
signalling relationships. For instance, the low expression of Cx43 in 
the cortical plate indicates a possible interaction with integrins— 
which play a role in the cessation of migration***°—and may aid in 
the detachment of neurons from radial fibres. 

To our knowledge, a functional role for gap junctions based on 
adhesion, rather than channel activity, has not been described to date. 
Might gap junction adhesions have a role in other gap-junction- 
mediated cellular processes? It is thought that gap junction channels 
as well as their interactions with molecules such as p120catenin, 
integrin and the actin cytoskeleton are important for neural crest cell 
migration***, and that glioblastoma invasion of the brain par- 
enchyma requires functional gap junctions between tumour cells 
and astrocytes’’. The migration of lung and skin cancer cells has also 
been associated with gap junction expression although no mech- 
anism has been proposed**. In addition, the neurological disorder 
Charcot-Marie-Tooth syndrome, caused by mutations in Cx32, may 
not only result from impaired gap junction communication but 
also from the loss of gap junction adhesions stabilizing Schmidt— 
Lanterman incisures. Further studies may elucidate an important 
role for the adhesive properties of gap junctions in the migration 
of neural crest cells and cancer cells, as well as in other diseases and 
physiological processes associated with gap junction function. 


METHODS SUMMARY 


Immunohistochemistry was performed following transcardial purfusion and 
cryostat sectioning. Light fixing techniques and citrate antigen retrieval were 
used for all connexin stains. Standard confocal images were acquired on an 
Olympus Fluoview 300 with 1m steps, and ‘high resolution’ images were 
acquired on an Olympus Fluoview 1000 with 0.5 tm steps. shRNA hairpins were 
inserted into the pLLox3.7 construct for expression*’. Plasmids were introduced 
into the in-vivo developing cortex by intraventricular injection of 1 pl of plasmid 
followed by electroporation’’. The concentration of the shRNA plasmids was 
1.7 ug ul ', and the concentration of the rescue constructs was varied according 
to the specified molar ratio. Intraventricular injections were carried out in E16 
timed-pregnant Sprague Dawley rats. For cell transplants, electroporations were 
performed at E16, GFP” cells were dissected from electroporated corticies at E17 
and dissociated, 50,000 cells per brain were intraventricularlly injected into wild- 
type E17 embryos, and embryos were fixed at E21. Rat Cx26 and Cx43 in the 
pEYFPNI1 vector (Clonetech) were altered with QuickChange site-directed 
mutagenesis (Stratagene) to make the various connexin rescue constructs. For 
adhesion assays, electroporations were performed at E16 and cells were dissected 
at E19, FACS-sorted for GFP, labelled with Vybrant DiD (Molecular Probes), 
and plated on 100% confluent C6 cells (containing the TVA receptor) or 
C6 + Cx43 cells (containing the TVA receptor and Cx43—-EYFP)*’, incubated 
for 30 min on a rotational shaker, and fixed. For actin or centrin co-expression 
experiments brains were electroporated at E16 and fixed at E19. For time-lapse 
imaging, electroporations were performed at E16, organotypic slice cultures 
were prepared’ at E18 and allowed to equilibrate overnight before images were 
collected using an Olympus Fluoview 300. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Immunohistochemistry and imaging. Embryos were fixed by transcardial pur- 
fusion with PBS followed by 4% paraformaldehyde in PBS (PFA). A subsequent 
perfusion of PBS was performed in embryos used for Cx26/43 staining. Brains 
were post-fixed over night (except for Cx26/43 staining), dehydrated in 20% and 
30% sucrose, frozen, and cryostat sectioned at 14 tum (except for Cx26/43 staining, 
which used 10 um). For endogenous knockdown assays, dissociated cortical cells 
were plated on Poly-t-Lysine coated coverslips, allowed to adhere for 2h, and 
fixed in PFA (10 min). Citrate antigen retrieval was preformed for Cx26/43 and 
Ki67 staining. Blocking solution contained 2% gelatin, 10% serum and 1% Triton- 
X-100 in PBS. Sections/cells were incubated overnight in primary antibodies: 
rabbit anti-Cx43 (1:50, Zymed), mouse anti-Cx43 (1:50, Zymed), rabbit anti- 
Cx26 (1:25, Zymed SI-2800), chicken anti-vimentin (1:1,000, Chemicon), mouse 
anti-B-III tubulin (TUJ1) (1:200, Covance), chicken anti-GFP (1:500, Aves Labs), 
rabbit anti-Ki67 (1:1,000, Novocastra), mouse anti-bromodeoxyuridine Alexa 
Flour 647 (1:50, Molecular Probes), mouse anti-nestin (1:100, Chemicon), rabbit 
anti ZO-1 (1:100, Zymed), mouse anti N-cadherin (1:100, Zymed), and mouse 
anti B1-integrin (1:100, BD Transduction Laboratories). TUNEL staining was 
performed using TMR red Cell Death Detection Kit (Roche). Secondary antibod- 
ies included Alexa 488/568 conjugates (Molecular Probes) or Cy3/Cy5 conjugates 
(Jackson). Standard images were acquired on an Olympus Fluoview 300 laser- 
scanning confocal microscope. ‘Standard’ images of connexin staining represent 
collapsed = 3 tm stacks collected at 1-1m steps, and other standard images rep- 
resent single optical sections. ‘High resolution’ images were collected on an 
Olympus Fluoview 1000 at 0.5-p1m steps. Images were analysed using 
Photoshop 7.0. 

Plasmid constructs. ShRNA oligonucleotides were inserted into the dual pro- 
moter construct pLLox3.7 for expression under the U6 promoter (EGFP was 
expressed under the CMV promoter)**. The following target sequences were 
used for Cx26, TCTGGAATTTGCATCCTGCTA (alternative target: GCAGC- 
GTCTGGTGAAGTGTAA); and Cx43, GCAATTACAACAAGCAAGCTA (alter- 
native target: GGCTTGCTGAGAACCTACATCATCA). The control shRNA 
oliognucleotiedes with three point mutations (underlined) were as follows for 
Cx26, TCTTGAATATGCATCGTGCTA; and Cx43, GTAATTGCAACAAGAA- 
AGCTA. The following target sequences were used for the P2Y1 receptor: P2Y1- 
shRNA(2) GGAGTGAGGCCAATTTACA, P2Y1-shRNA(3) GAGTACCTGCG- 
AAGTTATT. 

Rat Cx26 and Cx43 clones in the pEYFP-N1 vector (Clonetech) were a gift of 
D. Laird. QuickChange site-directed mutagenesis (Stratagene) was used to intro- 
duce the following point mutations as underlined: Cx26CM, CTCTGTGTCCG- 
GAATCTGCATACTGCTCAACATCACAGAGCTGTG; Cx43CM, CCTCGTG- 
CCGGAATTATAACAAACAAGCCAGCGAGCAAAACTGGG; Cx43C61S, CT- 
CAACAACCTGGCTCCGAAAACGTCTGCTATGAC; Cx26T135A, TCCCTG- 
TGGTGGGCCTACACCACCAGC; Cx43T54A, GGCTTGCTGAGAGCCTAC- 
ATCATCAGCATCC; Cx26 1-216, ATCACAGAGCTGTGCTATCTG TTCATT- 
AGAATTCGCTCAGGGAAGTCC; Cx43 1-238, TTCAAAGGCGTTAAGGA- 
TCCCGTGAAGGGAAG (introduced restriction site in bold). The P2Y1 recep- 
tor was cloned with the following primers: AGCTCGAGTGCCTGAGTTGG- 
AAAGAAG (sense), TCGAATTCGGGCGTAGTCGGGCACGTCGTAGGGG- 
TACAAACTTGTGTCTCCGTT (antisense), and inserted into the pEYFPN1 
vector (Clonetech) using the Xhol and EcoR1 restritction sites. The antisense 
primer introduced a haemagglutinin (HA) tag and removed the stop codon. 
Cell transfections and western blots. Western blots were performed on Cos-7 
cells transfected with FuGENE6 transfection reagent (Roche) with 1 jug of DNA, 
200 pl Opti MEM and 5 pl FuGENE6 per 35 mm dish. Samples were collected 3 
days post transfection in RIPA buffer (150 mM NaCl, 0.1%SDS, 1% triton X100, 
50mM TrisHCl pH 6.8, 1mM EDTA, 0.5% deoxycholate, 10% glycerol) plus 
protease inhibitors, normalized for protein concentration, denatured at 37 °C 
for 30 min in 4X loading buffer (6 mM Tris pH 6.8, 5% B-Mercaptoethanol, 2% 
SDS, .05% bromophenol blue), and run on Criterion Precast 12.5% Tris-HCl 
gels (Bio-Rad). Membranes were probed with rabbit anti-Cx43 (1:1,000, Zymed) 
or rabbit anti-Cx26 (1:2,000, Zymed 71-0500) followed by anti-rabbit peroxi- 
dase conjugate (1:2,000, Sigma) or with mouse anti-HA peroxidase for P2Y1- 
HA detection (1:2,000, Sigma H 6533). All blots were stripped and re-probed 
with mouse anti-y-tubulin (1:500, Sigma) and anti-mouse peroxidase conjugate 
(1:400, Sigma). For quantification, all western blots were normalized to the levels 
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of the loading control y-tubulin and band intensity was measured in Photoshop. 
C6 cells were transfected with FuGENE6 transfection reagent (Roche) with 
0.3 ug of DNA, 20 pl Opti MEM and 2 pl FuGENE6 per 16 mm dish. 

In-utero injection and electroporation/cell transplants. Animals were main- 
tained according to protocols approved by the Institutional Animal Care and 
Use Committee at UCSF. Plasmids were introduced into the developing cortex in 
vivo by intraventricular injection and electroporation”’. Intraventricular injec- 
tions were carried out in E16 timed-pregnant Sprague Dawley rats as previously 
described’. Electroporations were performed using an Electro Square Porator 
ECM830 (Genetronics) (5 pulses, 50 V, 100 ms, 1-s interval). DNA was prepared 
in endotoxin-free conditions and 1 tl was injected per brain at the following 
concentrations: shRNA constructs, 1.7 Lg ult; rescue constructs, at a molar 
ratio of 0.67:1 (Cx26 rescue construct: Cx26-shRNA) or 0.45:1 (Cx43 rescue 
construct: Cx43-shRNA); Cx26T135A and Cx43T154A 1.7 ug ul! ; Tomato 
lpgul”'; actin-cherry 1.7ugpl-'; centrin I-dsRed 2.0ugul'; HcRed 
2.5 ug pl’. For analysis of endogenous knockdown, electroporations were per- 
formed at E16 and cortices were dissociated at E18, allowed to adhere in culture 
for 2 h, fixed and stained. For cell transplants, electroporations were performed 
at E16, GFP* cells were dissected from electroporated corticies at E17 in 
ACSF(125mM NaCl, 2.5mM KCl, 1mM MgChL, 2mM CaCl, 1.25mM 
NaH,PO,, 25mM NaHCO; and 20mM glucose), the cells were dissociated 
and resuspended in culture media containing 25% Hanks BBS, 47% Basal 
Medium Eagle (Gibco), 5% Fetal bovine serum, 1X Pen/Strep, 0.66% glucose, 
1% N2-supplement (Invitirogen). Fifty-thousand cells per brain were intraven- 
tricularly injected into wild-type E17 embryos, and embryos were fixed at E21. 
The video protocol in ref. 44 fully explains the in utero intraventricular injections 
and electroporations. 

Quantification of cell distribution. The shRNA cell distribution phenotype was 
quantified by taking a total of 20 evenly spaced rostral to caudal sections per 
brain from 3 brains per condition. A threshold was applied in Photoshop and the 
fraction of cells in each of 5 rows of equal area (A—E) stacked parallel to the 
ventricular zone surface was quantified in each slice and then averaged across 
slices. The Cx43-shRNA rescue experiments were quantified in a similar manner 
by counting the fraction of cells in the cortical plate (layers C, D and E) in 10 
rostral to caudal sections per brain for 4—6 brains per condition. For the Cx26- 
shRNA rescue experiments, 5 medial to caudal sections per brain for 4—6 brains 
per condition were quantified. For transplant experiments, confocal stacks of all 
sections with cells that integrated into the cortex were collected and the number 
of cells in each area (A—E) was counted. All quantifications were performed in a 
blind manner. 

Cell sorting. Electroporations were performed at E16 and cells were dissected at 
E19 in ACSF. Cells were transferred to PBS 2%FBS immediately before sorting 
and kept on ice. Cells were sorted for GFP on a FACSAria (Becton Dickinson) 
using a 100 tm nosel, 488 laser with the 530/30 filter, and were collected in 
culture media. Collected cells were labelled with Vybrant DiD cell labelling 
solution (Molecular Probes) in serum-free culture media for 20 min and washed 
according to the instructed protocol. Ten-thousand cells were plated per well of 
100% confluent C6 cells grown on 24-well collagen biocoat plates (Becton 
Dickinson). C6 (containing the TVA receptor) and C6 + Cx43 (containing the 
TVA receptor and Cx43—EYFP) cells were a gift of A. Lai*'. Cells were incubated 
for 30 min, with shaking at 250 rpm, and then washed 4 times with PBS, and 
fixed. Each experiment included three or more wells of each condition. Five 
pictures of each well were taken (middle, right, left, top, bottom), and quanti- 
fications were carried out in a blind manner. 

Time-lapse imaging. Electroporations were performed at E16 and 300-pm thick 
vibratome sections were prepared at E18 in ACSF. Slices were plated on Millicell- 
CM inserts (Millipore) in culture medium and allowed to equilibrate overnight 
before imaging. Confocal images, were collected on the Olympus Fluoview 300 at 
2-5-|1m steps (to minimize bleaching) every hour for 8-10 h. Stacks containing 
the cell of interest were collapsed and analysed in Photoshop. The video protocol 
in ref. 45 fully explains the slice culture system used for time-lapse experiments. 


44. Walantus, W., Elias, L. & Kriegstein, A. R. /n utero intraventricular injection and 
electroporation of E16 rat embryos. J. Visualized Exp. 6 (http://www.jove.com/ 
index/Details.stp?1ID=236&VID=222) (2007). 

45. Elias, L. & Kriegstein, A. R. Organotypic slice culture of E18 rat brains. J. Visualized 
Exp. 6 (http://www.jove.com/index/ Details.stp?I|ID=235&VID=221) (2007). 
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No extreme bipolar glaciation during the main Eocene 


calcite compensation shift 


Kirsty M. Edgar’, Paul A. Wilson’, Philip F. Sexton't & Yusuke Suganuma? 


Major ice sheets were permanently established on Antarctica 
approximately 34 million years ago'’, close to the Eocene/ 
Oligocene boundary, at the same time as a permanent deepening 
of the calcite compensation depth in the world’s oceans*. Until 
recently, it was thought that Northern Hemisphere glaciation began 
much later, between 11 and 5 million years ago'*°. This view has 
been challenged, however, by records of ice rafting at high northern 
latitudes during the Eocene epoch®’ and by estimates of global ice 
volume that exceed the storage capacity of Antarctica*® at the same 
time as a temporary deepening of the calcite compensation depth 
~41.6 million years ago’. Here we test the hypothesis that large ice 
sheets were present in both hemispheres ~41.6 million years ago 
using marine sediment records of oxygen and carbon isotope values 
and of calcium carbonate content from the equatorial Atlantic 
Ocean. These records allow, at most, an ice budget that can easily 
be accommodated on Antarctica, indicating that large ice sheets 
were not present in the Northern Hemisphere. The records also 
reveal a brief interval shortly before the temporary deepening of 
the calcite compensation depth during which the calcite compensa- 
tion depth shoaled, ocean temperatures increased and carbon iso- 
tope values decreased in the equatorial Atlantic. The nature of these 
changes around 41.6 million years ago implies common links, in 
terms of carbon cycling, with events at the Eocene/Oligocene 
boundary‘ and with the ‘hyperthermals’ of the Early Eocene climate 
optimum*"*"’, Our findings help to resolve the apparent discrep- 
ancy between the geological records of Northern Hemisphere 
glaciation®* and model results’”'* that indicate that the threshold 
for continental glaciation was crossed earlier in the Southern 
Hemisphere than in the Northern Hemisphere. 

A striking feature of the composite’ Cenozoic 5'*O record in benthic 
foraminiferal calcite is the abrupt increase in values across the Eocene/ 
Oligocene boundary, 34 Myr ago (Fig. la). Together with evidence 
from Southern Ocean records for the initiation of ice rafting’* and 
glacial weathering, and from sequence stratigraphic studies’* for a glo- 
bal sea level fall (~70 m apparent sea level), this 5'°O shift is widely 
interpreted to signify the onset of major Antarctic glaciation’*'* 
(Fig. 1a). The standard view'*? is that substantial Northern Hemi- 
sphere glaciation occurred much later (Fig. 1a); recently, however, 
three lines of evidence have challenged this orthodoxy. 

First, evidence now exists for ice rafting in the Arctic Ocean around 
45 Myr ago® and in the Norwegian—Greenland Sea between 37 and 
27 Myr ago’ (Fig. 1a). Second, new records for the Eocene/Oligocene 
boundary from the equatorial Pacific Ocean (ODP Leg 199 of the 
Ocean Drilling Program) show‘ that the amplitude of the 5'°O change 
(A3!"Openthic = 1.5% VPDB) is impossibly large to be attributed to 
Antarctic glaciation alone. The simplest explanation for this result is 
that some of the Ad'’Oxenthic signal must denote global cooling’. 
However, Mg/Ca records”'® provide little support for declining 


temperatures. This raises the possibility of ice growth beyond Antar- 
ctica and/or the operation of some factor (such as changes in seawater 
carbonate chemistry associated with increasing calcite compensation 
depth, CCD) acting to mask the cooling signal in the Mg/Ca records*"®. 
Third, the 5!80 increase associated with the Eocene/ Oligocene bound- 
ary occurred in lock-step with permanent deepening of the CCD, 
possibly in response to glacioeustatic sea level fall and reduced shelf 
carbonate accumulation*.Thus, the discovery”” of multiple intervals 
of temporary CCD deepening in the equatorial Pacific earlier in the 
Eocene (Fig. 1b) has prompted speculation of transient glaciations 
before the Eocene/Oligocene boundary. A recent study* invokes three 
such glaciations including one extreme bipolar event (100-190m 
apparent sea level change; ~35-70 X 10°km? ice) coincident with 
the most prominent of the transient Eocene CCD deepening events 
(carbonate accumulation event CAE-3 ~41.6 Myr ago; ref. 9 and 
Fig. 1b, c). Yet the idea that large ice sheets existed in both hemispheres 
at this time is at odds with contemporaneous warm polar ocean tem- 
peratures”*'*, high atmospheric CO) levels’? (Fig. 1a) and the occur- 
rence of (sub)tropical flora at mid- to high latitudes”. Sequence 
stratigraphic records*' support the concept of early Cenozoic glacia- 
tions, but the ice sheets invoked are much more modest in size (~10 to 
45 m apparent sea level change; ~ 40 to 65% of present East Antarctic 
ice sheet) and considered compatible with high-latitude warmth 
because of their restriction to the interior of Antarctica”'. 

Although the sensitivity to CCD change” of the Pacific drill sites 
used to infer extreme bipolar Eocene glaciation is beneficial to under- 
standing carbon cycling, it is detrimental to generating isotope 
records from calcareous microfossils whose occurrence and preser- 
vation are sensitive to changes in carbonate saturation state. Thus, 
further tests of bipolar Eocene glaciation are warranted. We gener- 
ated high-resolution (~4.5 kyr) monospecific stable isotope records 
for the interval spanning CAE-3 (ref. 9) from the equatorial Atlantic 
Ocean (ODP Leg 207, Demerara Rise, Site 1260, 9° 16’ N, 54° 33’ W; 
palaeowater depth 2,500-3,000 m)”. These sediments were depos- 
ited well above the local CCD for most of the study interval and are 
shallowly buried”, favouring calcareous microfossil preservation”. 

In Fig. 1c—e, we compare our new 8'°O records from the equatorial 
Atlantic with the record presented® from the equatorial Pacific in 
support of bipolar glaciation. 5'°O data from the equatorial Pacific 
show a considerable spread, and the amplitude of increase used to 
infer extreme bipolar glaciation relies on outliers in sparse data 
(A8'Openthic = 1.2%0 VPDB, arrows, Fig. 1c). Our data from the 
Atlantic are more continuous and show less spread (Fig. 1d, e). 
Our benthic record (Fig. 1d) shows a well-defined minimum around 
the middle of magnetochron C19r followed by a shift to higher values 
attained by the onset of CAE-3 (zone I/II boundary, Fig. 1c). Our 
planktic record shows a similar pattern (Fig. le). One interpretation 
of the correspondence between our planktic and benthic records is 
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that it reflects an increase in the oxygen isotope composition of sea 
water (5'°O,-awater) and supports the notion of ice growth. Another 
interpretation, arguably more in keeping with the other palaeocli- 
mate records’? and numerical modelling results'”’* shown in Fig. 1a, 
is that our data are attributable to cooling of surface and bottom 
waters at Site 1260. Regardless, our estimate of the A5 8 Openthic signal 
associated with CAE-3 is about half that proposed* from the Pacific 
(Fig. 1d versus Fig. 1c). 

In Fig. 2 we show the relationship between measured AS 8 Openthic 
and global continental ice volumes for a range of values for mean 
3'80 of stored ice (8'%O,..). Horizontal dashed lines denote different 
values for AdS'’Openthic Vertical solid lines correspond to different 
values assumed for 5'°0O;,. and intersections yield the resulting esti- 
mated ice volumes (see ‘isovol’ lines). Lines a and b assume that the 
whole Ad’ Openthic signal is attributable to ice volume. Assuming that 
5'8Oice was as extreme as today (about —50%0 SMOW*™”), the 
Ad!"Openthic Signal inferred in the Pacific study* for CAE-3 (line a 
in Fig. 2) yields a global middle Eocene ice budget almost 1.5 times 
the present Antarctic ice volume (25.4 X 10° km?)**, However, the 
latitudinal temperature gradient during the Eocene was less extreme 
than today, so it is likely that 3'80,.. was also less extreme (conserva- 
tively, about —30%0 SMOW*”*”?), yielding even larger ice budgets 
(almost 2.5 times the modern Antarctic budget). This ice volume is 
close to the global total estimated for the last glacial maximum 
(Fig. 2), when large ice sheets existed not only on Antarctica and 
Greenland but also over large areas of North America and Eurasia, 
and it implies an even greater global ice budget for the Eocene/ 
Oligocene boundary when benthic 5'%O values were ~0.8%o higher 
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(Fig. 1a). Even larger ice volumes were invoked*® for CAE-3 in the 
Pacific study using the dual benthic Mg/Ca—6&'*O method 
(A8'"O.cawater = 1.5%o). But we focus on the Ad’Openthic signal alone 
because the presence of authigenic dolomite’®'’ towards the base of the 
section at ODP Site 1218 means that the only potentially meaningful 
Mg/Ca data published for CAE-3 are for its termination (see figure 3 of 
ref. 8). By focusing on the AS!’Opentnic Signal alone we adopt a conser- 
vative approach because the ice budgets calculated in this way in ref. 8, 
although extreme (because the whole signal was attributed to ice 
growth with no associated cooling), are less extreme than those that 
were estimated by the dual benthic Mg/Ca — 5'°O method. 

If we assume no cooling component associated with the smaller 
Ad'8Openthic signal seen in our new record (line b, Fig. 2), our revised 
estimate of the upper limit of possible ice growth associated with 
CAE-3 is correspondingly smaller, it need not require ice storage in 
the Northern Hemisphere and it is more in keeping with sequence 
stratigraphy records*'. On this basis, we cannot exclude the possible 
existence of small valley glaciers draining the uplands of Greenland 
around 41.6 Myr ago, particularly in light of dropstones recently 
discovered®’ in Eocene strata in the high northern latitudes. 
Nevertheless, we can rule out the existence of large ice sheets in the 
Northern Hemisphere and we therefore find no support for extreme 
Eocene bipolar glaciation. Based on work elsewhere**’® we might 
expect the AS'’Openthic Signal to be composed of at least equal parts 
ice volume and temperature (line b’, Fig. 2). On this basis we cal- 
culate ice budgets that are easily accommodated on central Antarctica 
alone (~0.4 to 0.6 times modern Antarctic budget). Under these 
circumstances, only by assuming an extremely high value for 
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Figure 1| Palaeoceanographic records showing changes in ocean 
chemistry and global climate. a, Cenozoic atmospheric CO) (ref. 19), grey 
shading defines the range of estimated CO> values as calculated in ref. 19 
relative to the pre-anthropogenic level (PAL). Vertical dashed blue lines are 
model-defined thresholds for large ice sheet initiation. SHG and NHG, 
Southern and Northern Hemisphere glaciation’”"’. Composite benthic 5'°O 
record’, Vertical blue bars show the standard view of the presence and extent 
of full scale/permanent ice sheets (solid bars) and those thought to have been 
partial/ephemeral (broken bars). Occurrence of newly documented ice 
rafted debris (IRD) from the high northern latitudes is also shown’. EECO, 
Early Eocene Climatic Optimum. b, CaCO; mass accumulation rate for the 
equatorial Pacific’”’. The inset shows the classic CCD record for the 
equatorial Pacific 50 Myr ago to the present, with the pink-shaded zone 
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indicating the interval of transient CCD events’. Epochs are indicated 
(abbreviated) to the left of a and in the inset. ¢, Benthic 5!°O record from 
ODP Site 1218 (the dark green data points are from ref. 8 and yellow ones are 
from ref. 16). The shaded grey zones are: I, pre-onset of CAE-3; II, onset-to- 
peak CaCO; mass accumulation rate during CAE-3; III, post-peak CaCO3 
mass accumulation rate during CAE-3. Red arrows denote data points 
defining’ a 1.2% amplitude of 5'8O increase (A5'8O) and thus the 
hypothesized extreme bipolar glaciation. d, e, New benthic and planktic 
8!°O records from ODP Site 1260. Median values (white lines) and 1¢ 
variance (shaded envelopes) are shown in ¢, d and e. Vertical dashed red lines 
in d denote our smaller estimate of AS'8O (~0.6%o VPDB), derived from the 
maximum and minimum values in the 1 envelope of the benthic record. 
Magnetochrons are indicated to the left of c. 
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5'°Oic¢ akin to the average for high altitude glaciers in the temperate 
zone today (=—15%o0 SMOW) does the calculated ice volume exceed 
the modern budget for Antarctica (Fig. 2). 

The main reason why the ice budgets of ref. 8 are so large is because 
their estimate of A3!’Openthic Was based on outlying data points and 
their isotope record is particularly sparse towards the top of magne- 
tochron C19r (note the coring gap on the run up to CAE-3 and 
inferred glaciation, at the zone I/II boundary in Fig. 1c). To assess 
this interval in more detail we show an expanded version of our 
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Figure 2 | Measured increases in 5'°0 in benthic foraminifera (A5"°Openthic) 
and resulting global continental ice volumes for a range of values for the 
mean 5"20 of stored ice (5'°0;,.). Horizontal dashed lines indicate different 
values for A5'8Openthic (described in the text) and assume A5'’Openthic = 100% 
ice volume (a, equatorial Pacific®; b, equatorial Atlantic, this study). Thick 
vertical lines correspond to different values of 5'80;,. (estimated 5'°0;.. today 
at —50%o and for the Middle Eocene at —30%o). Intersections yield resulting 
calculated ice volume shown by ‘isovols’ (values shown are X 10°km*). The 
horizontal dashed line labelled b’ corresponds to Ad'’Opentnic = 0.6%o, with 
50% of that signal attributable to ice. Present Antarctic ice volume (PAIV)”, 
Antarctic ice volume at the Last Glacial Maximum (ALGM)”* and total global 
ice volume at the Last Glacial Maximum (GLGM) shown for reference. 
SMOW, standard mean ocean water. 


foraminiferal 5'%O series together with accompanying 81°C, litho- 
logical and bulk sediment records in Fig. 3. Excursions to lower values 
of 5'°O (by ~0.4%o) and 8'°C (by 1.2-1.5%o) occur in both benthic 
and planktic records. These are accompanied by prominent decreases 
in bulk 5'8O (by 1.8%) and 8'°C (by 1.1%o) within magnetochron 
C19r, across a 30 cm interval containing a highly dissolved foraminif- 
eral assemblage and a well-developed red clay horizon where carbon- 
ate content falls sharply from ~75 weight percent CaCO; to a 
minimum of ~35 weight percent CaCO; (Fig. 3f). This event goes 
undetected in the equatorial Pacific record, presumably because it falls 
in the core gap at ODP Site 1218 (Fig. 1c) and therefore cannot explain 
the discrepancy between the two Ad" Openthic signals shown in Fig. 1. 
On the other hand, our records show that, while ODP Site 1260 
remained above the CCD for the vast majority of the study interval, 
shortly before the CCD deepened in the equatorial Pacific (CAE-3 in 
Fig. 1), it shoaled in the equatorial Atlantic sufficiently to be readily 
detectable at a water depth of about 3.0 to 2.5 km. In this respect, our 
records resemble those for the Eocene/Oligocene boundary where the 
CCD shoals sufficiently to completely eliminate CaCO; burial at ODP 
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Figure 3 | Palaeoceanographic records from ODP Site 1260 showing the 
run-up to CAE-3. We note the short-lived interval of CCD shoaling and 

ocean warming around 75 metres composite depth. a, Benthic 5'°C data. 
b, Planktic 5!°C data. ¢, Benthic 8'%O data. d, Planktic 5!8O data. e, Bulk 
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sediment lightness”. f, Bulk sediment weight %CaCO3. g, Bulk sediment 
38°C. h, Bulk sediment 5!80. Question marks in a—d denote absence of two 
to three samples because of the lack of suitable foraminifera for analysis in 
this interval. Grey shaded zones I and II as in Fig. 1. 
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Site 1218 for ~200 kyr immediately before the rapid two-step deepen- 
ing associated with Antarctic glaciation (see figure 1 of ref. 4). 

We cannot yet determine whether the C19r event is global in 
occurrence because of the coring gap at ODP Site 1218 and radio- 
larian clays and low sedimentation rates (<0.2 cm kyr” ') at other key 
drill sites in the Equatorial Pacific’ and on Shatsky Rise’’. But it is 
distinct from the much longer-lived Middle Eocene Climate 
Optimum already documented at Site 1260’* and it is more similar 
in expression to the well-known intervals of rapid carbon-cycle-led 
global warming and CCD shoaling in the early Palaeogene. The mag- 
nitude of the palaeoceanographic signals that we have documented 
for magnetochron C19r at Site 1260 is smaller than seen at the 
Palaeocene/Eocene boundary” but at least as large as those marking 
the Eocene Thermal Maximum 2 (~53 Myr ago)". 

Our findings appear to highlight the instability of Eocene climate 
and to emphasize close coupling to the carbon cycle as expressed in 
oceanic carbonate saturation and 81°C. If the interval of CCD shoal- 
ing and isotopic shifts that we document is substantiated elsewhere, 
our results suggest that carbon-cycle-led instabilities akin to those 
documented in the early Eocene were not restricted to the Cenozoic 
climate optimum but also occurred millions of years later when the 
Earth’s baseline climate had cooled substantially and was poised 
much closer to the threshold of continental glaciation (Fig. la). 
Climate modelling experiments suggest'”’* that this threshold would 
have been crossed earlier in the Southern Hemisphere than in the 
Northern Hemisphere (Fig. la) because of the fundamentally differ- 
ent land—ocean distributions at the two poles. Our work helps to re- 
align the geologic record with this view. 


METHODS 

Chronology. We use the published** palaeomagnetic reversal stratigraphy for 
ODP Site 1260, which we supplemented by analysing nearly 100 additional 
samples collected at approximately 20 to 30cm intervals across each of the 
magnetic reversals in the studied stratigraphic interval (magnetochrons C20r 
through C18r). Methods are as in ref. 28. 

CaCO; and stable isotope data. All data were generated at the National 
Oceanography Centre, Southampton, UK. Bulk sediment weight %CaCO3 was 
measured on small (20-30 mg) discrete samples (1-5 cm spacing), by high- 
precision coulometry. Our foraminiferal stable isotope (5'°O and 5'°C) data were 
generated using species separates of the benthic and planktic foraminifera 
Cibicidoides eoceanus and Morozovella lehneri respectively. Foraminifera were 
cleaned by ultrasonication before analysis. Benthics were picked from the size range 
300-450 ttm while planktics were picked from the 250-300 um size fraction. Bulk 
stable isotope measurements were generated on splits of the same samples used for 
CaCO; data. All stable isotope measurements were determined using a Europa 
GEO 20-20 mass spectrometer equipped with an automatic carbonate preparation 
system. Results are reported relative to the Vienna Pee Dee Belemnite (VPDB) 
standard with an external analytical precision, based on replicate analysis of an in- 
house standard calibrated to NBS-19, better than 0.1%o for 8!8O and 61°C. Benthic 
5'8O values that we report from ODP Site 1260 have been adjusted to equilibrium 
by adding 0.28%o VPDB, the Palaeogene correction factor for Cibicidoides. We plot 
Cibicidoides data from ODP Site 1218 in the same way to aid comparison (Site 1218 
data from Nuttallides truempyi are reported in the same way as in ref. 8). 
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Northern Hemisphere forcing of climatic cycles in 
Antarctica over the past 360,000 years 


Kenji Kawamura’’+, Frédéric Parrenin®, Lorraine Lisiecki*, Ryu Uemura’, Francoise Vimeux®”, 
Jeffrey P. Severinghaus”, Manuel A. Hutterli®, Takakiyo Nakazawa’, Shuji Aoki’, Jean Jouzel’, Maureen E. Raymo’, 
Koji Matsumoto'+, Hisakazu Nakata'}, Hideaki Motoyama’, Shuji Fujita’, Kumiko Goto-Azuma”, Yoshiyuki Fujii> 


& Okitsugu Watanabe” 


The Milankovitch theory of climate change proposes that glacial— 
interglacial cycles are driven by changes in summer insolation at 
high northern latitudes’. The timing of climate change in the 
Southern Hemisphere at glacial—interglacial transitions (which 
are known as terminations) relative to variations in summer 
insolation in the Northern Hemisphere is an important test of this 
hypothesis. So far, it has only been possible to apply this test to the 
most recent termination”’, because the dating uncertainty assoc- 
iated with older terminations is too large to allow phase relation- 
ships to be determined. Here we present a new chronology of 
Antarctic climate change over the past 360,000 years that is based 
on the ratio of oxygen to nitrogen molecules in air trapped in the 
Dome Fuji and Vostok ice cores*’. This ratio is a proxy for local 
summer insolation’, and thus allows the chronology to be con- 
structed by orbital tuning without the need to assume a lag 
between a climate record and an orbital parameter. The accuracy 
of the chronology allows us to examine the phase relationships 
between climate records from the ice cores*’ and changes in 
insolation. Our results indicate that orbital-scale Antarctic climate 
change lags Northern Hemisphere insolation by a few millennia, 
and that the increases in Antarctic temperature and atmospheric 
carbon dioxide concentration during the last four terminations 
occurred within the rising phase of Northern Hemisphere summer 
insolation. These results support the Milankovitch theory that 
Northern Hemisphere summer insolation triggered the last four 
deglaciations*'°"’. 

Antarctic deep ice cores have provided a wealth of information on 
glacial—interglacial climate change and atmospheric composition®”’. 
However, uncertainties of current timescales are large. For the Dome 
Fuji’, Vostok’? and Dome C" cores, timescales based on inverse 
methods using ice flow models disagree by up to ~5,000 years 
(~5kyr), because of uncertainties in modelled accumulation rate 
and/or ice thinning’. Another approach is orbital tuning, in which 
a global atmospheric signal (8'80 of Os, or concentration of CH,) is 
matched to presumed orbital forcing with an assumed lag'*'”. This 
approach is generally affected by the difference between the ice age 
and the gas age, which adds error of the order of 1 kyr to the ice 
chronology”, in addition to uncertainty in the assumed lag of several 
kyr (see below). Here we eliminate most of these uncertainties with a 
new orbital tuning that uses the O,/N, ratio of trapped air, which is a 
proxy for the local summer insolation at the deposition sites’. 


Because O, molecules are preferentially excluded from freshly 
formed air bubbles during the snow-ice transition’’, the O2/N; ratio 
of air trapped in Antarctic ice is depleted by 5—10%o on average from 
the atmospheric ratio. The O./N, in the Vostok® and Dome Fuji* cores 
(Fig. 1) varies in accordance with local summer insolation with an 
amplitude of ~ 10%o, with stronger insolation leading to more O2/N> 
depletion. Local summer insolation presumably controls the physical 
properties of near-surface snow, which in turn determines the mag- 
nitude of O2/N; fractionation during the bubble close-off’. The O2/N> 
records of Antarctic inland cores are expected to be in phase with 
summer solstice insolation. Support for this view comes from the 
present-day observation at Dome Fuji of maxima within several days 
of the solstice in snow temperature (10-cm depth) and vertical tem- 
perature gradient (between 10 and 50cm) (Supplementary Fig. 1), 
both of which are important factors for snow metamorphism’. 
Positive feedback between snow grain size and absorption of solar 
radiation, as well as a negative relationship between surface albedo 
and solar zenith angle, increases the importance of the solstice’. 

Because of the physical link, the O,/N, orbital tuning is distinct 
from traditional orbital tuning for ice and marine cores using global 
signals. The Dome Fuji and Vostok’ O,/N, records show no detect- 
able climatic imprint (Supplementary Information), a feature that 
makes this proxy an ideal tuning tool. In particular, the O2/N> 
records lack a nonlinear 100-kyr response, which has to be ignored 
or modelled during traditional orbital tuning. Our timescale is thus 
ideal for examining terminations. Another advantage is that our 
method gives the age of ice despite its use of the gas record, because 
the O./N> fractionation during bubble formation is determined by 
the physical properties of surrounding ice. 

We derive chronological tie points between the O2/N>2 record 
and local summer solstice insolation by peak-to-peak matching 
(Methods), to modify the original glaciological timescale? (DFGT- 
2003). We can use O2/N> data only from the depths in which air is 
occluded as clathrate hydrates*, which restricts the youngest tie point 
to ~82 kyr ago (all dates given are relative to AD 2000). The DFGT- 
2003 timescale, which has a constraint at ~41 kyr ago, is unchanged 
for 0-41.5 kyr ago and its agreement with Greenland timescales is 
probably within 1 kyr, based on a comparison with the Byrd-core 
isotope record on the GISP2 timescale’*. We linearly interpolated 
the DFGT-2003 timescale between all the O,/N, tie points, as well 
as between 41.5 and 82 kyr ago. 
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France. “Department of Earth Sciences, Boston University, 685 Commonwealth Avenue, Boston, Massachusetts 02215, USA. °National Institute of Polar Research, Research 
Organization of Information and Systems, 1-9-10 Kaga, Itabashi-ku, Tokyo 173-8515, Japan. “Institut de Recherche pour le Développement (IRD), UR Great Ice. ’IPSL/LSCE, Laboratoire 
des Sciences du Climat et de l'Environnement, UMR CEA-CNRS-UVSQ, CE Saclay, Orme des Merisiers, 91191 Gif-sur-Yvette, France. ®British Antarctic Survey, High Cross, Madingley 
Road, Cambridge CB3 OET, UK. {Present addresses: National Institute of Polar Research, Research Organization of Information and Systems, 1-9-10 Kaga, Itabashi-ku, Tokyo 173-8515, 
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We applied our method also to the Vostok core using published 
data”? (Fig. 1). The new Vostok timescale (VkO-2006) resulted in an 
agreement with the Dome Fuji O2/N> timescale to within 1 kyr for 
most of the overlapping period (~200-340 kyr ago), indicating that 
our method is robust. A large discrepancy was found only where the 
Dome Fuji O2/N> data are too noisy to reliably constrain the time- 
scale (220-230 kyr ago, Fig. 1b and c). To improve and extend the 
Dome Fuji O2/N> timescale, we adopted tie points at 221.2, 230.8 and 
334.9 kyr ago (Fig. 1c) from the Vostok core, by transferring them to 
the Dome Fuji core by visual matching of the isotopic records”"” (the 
final Dome Fuji timescale is named DFO-2006). We note that an 
independent study", limited to Vostok, used new’ and published” 
O,/N> data to derive a similar timescale, and discussed palaeoclimate 
implications. 

The dating uncertainties at the tie points (Fig. 1c) due to the noise 
in the O2/N> data (for example, from non-orbital natural variation, 
experimental error and gas loss correction) are estimated with a 
Monte Carlo method to range from 0.8 to 2.9 kyr (Methods). The 
uncertainty between the tie points originating in the DFGT-2003 
timescale and its linear interpolation is estimated using an autocor- 
related random walk model (Fig. le; Supplementary Information). 
The accuracy of the DFO-2006 timescale is validated through 
comparison with four radiometric-dated time markers spanning 
92-131 kyr ago, all of which agree within the uncertainty (Sup- 
plementary Information). 

This accurate chronology permits a critical examination of the 
phase relationship between Antarctic climate and orbital variations 
(Fig. 2). We examine the 8'°O;,.. record of the Dome Fuji core and a 
preliminary reconstruction of ‘site temperature’ (AT\-) based on an 
isotopic inversion, which accounts for the influence of seawater 5'°O 
(hereafter 5'80,,,) and vapour source temperature” (Methods). Our 
chronology also indirectly gives the timing of the CO; rise at termi- 
nations, which occurs within ~1 kyr of the increase in Antarctic 
temperature”. 

A striking feature of Dome Fuji 5'°Oj.¢ and ATyite records is that 
they appear to reflect every precessional-band variation of the north- 
ern insolation curve (Fig. 2), even for Marine Isotope Stages (MIS) 6c 
and 8c where changes are small, as well as the weak variation in MIS 3. 


LETTERS 


The result confirms earlier observations®” of the similarity of the 
patterns of northern insolation and the Antarctic climate, and further 
provides the phasing between them. The 5'°O;-¢ power spectrum 
(Fig. 3) shows strong power in the precession band (23-kyr period) 
with a mean lag of 1.0 + 0.5 kyr behind precession. The site temper- 
ature AT,;¢ shows smaller obliquity and precession components than 
5'°Ojce, and the precession component is as strong as the obliquity 
component (Fig. 3). The lag of AT,;,. behind precession is 1.8 + 0.5 kyr. 
At 41-kyr period, 5'%O,.. and AT, lag behind obliquity by 2.1 + 0.7 
and 4.7 + 1.1 kyr, respectively. Importantly, the southern summer 
insolation (Fig. 2) is in anti-phase (or is completely out of phase) 
with Antarctic climate, so does not appear to have a strong influence 
on the orbital-scale Antarctic climatic changes. These phasings 
suggest that the Antarctic climate on orbital timescales is paced by 
northern summer insolation presumably through northern ice sheet 
variation, although minor influence by southern insolation cannot 
be entirely ruled out. Our data therefore disfavour hypotheses that 
call for early Southern Hemisphere warming by southern summer 
insolation to trigger the northern deglaciation”’. 

Another remarkable feature of the data is that the last four 
Antarctic terminations occur within the rising phase of northern 
summer insolation. We examine the onset of rapid warming towards 
interglacial warmth, excluding prior gradual warming for termina- 
tions II and III (Fig. 4). Because the gradual warming is attributable 
to a linear climatic response to northern summer insolation for all 
climatic substages, they should be separated from terminations when 
examining hypotheses'”'*”?*. The onsets are distinctly defined in the 
Antarctic climate records for terminations I, II and IV (Fig. 4). We 
identify the onset of termination III as the start of the continuous 
warming at 247 kyr ago after a brief temperature reversal, but our 
conclusion is unchanged if the prior small peak is included. These 
onsets occur 6, 3, 7 and 2 kyr later than the 65° N summer solstice 
insolation minima, respectively, for terminations I, II, II] and IV 
(Supplementary Table 1), and the entire duration of these warmings 
also fit within the rising phase of northern summer insolation (Fig. 4). 
The timing of the last four terminations is thus fully consistent with 
Milankovitch theory*”®. One possible explanation for the difference 
in the timings of terminations relative to insolation is the hypothesis 
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Figure 1| Orbital tuning of the Dome Fuji and Vostok timescales using 
O./N; records. a, Dome Fuji 5'°0;,, (red, ref. 7) and Vostok temperature” 
(blue, converted from 5D) on the respective O2/N> timescales (DFO-2006 
and VkO-2006). b, O2/N, records of the Dome Fuji’ (filled red circles) and 
Vostok? (filled blue circles) cores and filtered curves (solid lines) on the 
O,/N> timescales, and filtered curves on the original glaciological timescales 
(dashed lines). ¢, Age tie points for the Dome Fuji (red) and Vostok (blue) 


cores with 2c error bars. The tie points at 221.2, 230.8 and 334.9 kyr ago for 
DFO-2006 are adopted from the Vostok data (see text). Two discarded 
points of the Dome Fuji core are shown (grey markers). d, Summer solstice 
insolation at 77° S as the tuning target (inverted axis scale). e, 20 dating 
uncertainty of DFO-2006 (red) and VkO-2006 (blue, for oldest part) (see 
Supplementary Information). 
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Figure 2 | Comparison of Dome Fuji climatic and atmospheric records with 
insolation and sea level records. a, Dome Fuji 8'80ice (black, ref. 7) and 
temperature deviation (AT,;,.) from the mean of the last 10 kyr (red, this 
study) on the DFO-2006 timescale, and summer solstice insolation at 65° N 
(orange) and 65° S (light blue dashed line). The insolation curves are scaled 
to match 65° N insolation with low-pass filtered 8!80 (not shown) for the 
MIS 5b-a transition. Corresponding MIS numbers” are shown without 
implying their exact timings. The AT,;,. and 5180,.. records are scaled to 
visually match the changes from the LGM to early Holocene. b, Atmospheric 
CO, concentration from the Dome Fuji core by wet extraction® (black 
circles) and dry extraction (pink triangles, this study). (Two wet-extraction 
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Figure 3 | Power spectra of 5'°O and AT. ite records calculated by the 
Blackman-Tukey method with 50% lags. The results (obtained using 
AnalySeries software) are normalized so that the integrated power over 
periods longer than 10 kyr becomes unity. Periods of three spectral peaks are 
shown. 
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CO, outliers® (at 149.8 and 365.0 m, in Holocene ice) are excluded from this 
plot. Experimental error rather than chemical reaction in meltwater* may be 
suspected for these samples, which were measured on the first day of 

~80 measurement days.) Note that the dry extraction values are too low in 
the transition zone from air bubbles to clathrate hydrates (for ~30—55 kyr 
ago), and the wet-extraction values are too high in the LGM® (around 20 kyr 
ago). ¢, Dome Fuji 8!8O.tm record (this study, inverted axis scale). d, Sea 
level reconstruction based on radiometric dating of fossil corals with open 
system correction”. e, Orbitally tuned 5'%O,,, (inverted axis scale) 
reconstructed from isotope records of marine sediment cores through 
regression analyses” (solid line) and ice sheet modelling*’ (dashed line). 


that obliquity and ice volume may modulate the exact phasing 
between climate and northern summer insolation''. 

The onset of the last Antarctic termination was possibly linked to 
the 19-kyr meltwater pulse, which ended the Last Glacial Maximum 
(LGM)*?>*, If the same mechanism holds for the previous termina- 
tions, the initial rapid ice volume decrease is predicted to coincide 
with or precede the onset of Antarctic terminations. A U-Th-dated 
marine core places the onset of 5'°O,,, change at ~138 kyr ago” for 
termination II, in agreement with the Antarctic timing (137.0 + 
2.2 kyr ago). Therefore, contrary to hypotheses ascribing the trigger 
of glacial terminations to CO, (ref. 14), obliquity”, or southern 
summer insolation”, our chronology implicates northern summer 
insolation as the primary trigger*!°"’. 

The major part of post-interglacial Antarctic cooling (MIS 9e-d, 
7e-d, and 5e—d transitions) also coincides with decreasing 65°N 
summer solstice insolation (Fig. 4). Comparison of the Dome Fuji 
ATyite With the CO, concentration and a radiometric-dated sea level 
reconstruction (Fig. 2d) as well as orbitally tuned 5'°O,y records 
(Fig. 2f) reveals that the Antarctic cooling for the MIS 5e-d and 
7e—d transitions began earlier by several millennia than the corres- 
ponding CO, and sea level drops (Fig. 2). This suggests that post- 
interglacial cooling began in the Northern Hemisphere with ice 
area growth and was transferred to Antarctica quickly through 
modulation of poleward heat transport and methane concentration 
decrease’—before the reduced CO, forcing, or the sea level drop 
caused by northern ice volume growth, became significant. 
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Figure 4 | Comparison of Antarctic parameters with insolation and 
obliquity around the last four terminations. Shown are Dome Fuji 3180;.. 
(black, ref. 7) and temperature AT,i,. (red, this study) on the DFO-2006 
timescale compared with 65° N summer solstice insolation (orange) and 
obliquity (blue dashed line). The Vostok temperature’ on the VkO-2006 


The phasing between the isotopic composition of atmospheric 
molecular oxygen (8'8O.tm) and northern summer insolation varies 
by +6kyr (Fig. 2), showing that our timescale is inconsistent with 
5'8Otm-orbital tuning". In addition, prior studies have used 8'°O atm 
as a proxy for 5'%O,,, and thus sea level®'*. However, the fact that 
5'8Oatm lagged the 19-kyr meltwater pulse by 4 kyr indicates a large 
change of the fractionation between atmospheric and seawater 8'°O 
(known as the Dole effect) in the interval 19-15 kyr ago, creating the 
false impression® that the onset of sea level rise lagged the Antarctic 
temperature and CO), increases. The Dole effect also drove 5'°O.tm 
change after ~7 kyr ago, when 5'*O.tm shows a clear shift to glacial- 
like values despite the stable sea level (Fig. 2). The DFO-2006 
timescale reveals another Dole-effect-forced lead of 8'°Oxtm to 
SB Owy by ~10 kyr at the MIS 5e—d transition (Fig. 2). These obser- 
vations indicate that the Dole effect involves complex glacial— 
interglacial changes, and thus orbital tuning and sea level reconstruc- 
tion based on 5'*Ont (ref. 14) are not accurate enough to assess the 
causality of glacial cycles*. Rather, 5'°O,tm should be used for recon- 
struction of the past Dole effect based on the accurate timescale, 
which is of biogeochemical interest’. 

In summary, the mean phasing of Antarctic climate, as well as the 
timing of the last four terminations and three post-interglacial cool- 
ings, are consistent with the hypothesis that high northern latitude 
summer insolation is the trigger of glacial—interglacial cycles. The 
role of CO, as conveyor and amplifier of the orbital input should 
be quantified with climate models run using our new timescale; this 
quantification is important for future climate change predictions. 
Our timescale should be validated further with new radiometric 


timescale is also plotted for termination IV (green). Numbers indicate key 
timings (minima and maxima of insolation and obliquity, and onset and end 
of 880; increases). The age tie points with 2¢ uncertainties are shown at 
the top. 


age markers, as well as by process studies for complete understanding 
of the physical link between O./N> and local insolation. With future 
O,/N> measurements, it may be possible to apply this method to the 
Dome Fuji and Dome C cores for termination V and older termina- 
tions, to investigate the phasing of climate and atmospheric com- 
position with respect to orbital forcing further back in time. 


METHODS SUMMARY 


All gas measurements on the Dome Fuji core were carried out at Tohoku 
University. Ice-core air was extracted with a melting technique® and measured 
with a mass spectrometer (Finnigan MAT Delta-S) with lo uncertainty of 0.2%o, 
0.02%o and 0.04%o for 5(O2/N3), 5'°N of Nz and 8'8O of O3, respectively. The 
O,/N, and 8'8O data were corrected for gravitational enrichment in the firn by 
subtracting 4 X 5'°N and 2 X 8°N, respectively. Because O,/N) in clathrate ice 
gradually becomes depleted over years in a warm freezer (—25 °C) owing to 
diffusive gas loss’, the data were corrected according to the storage period, 
and outliers were rejected. The new CO, data were obtained with a dry extraction 
method and gravitationally corrected. 

Dome Fuji tuning tie points were made for ~82-335 kyr ago at all maxima and 
minima of a low-pass filtered O,/N, curve and summer solstice insolation at 
77° S. The DFGT-2003 glaciological timescale’ was modified according to the tie 
points (see text). The Vostok tie points were made for ~186-354 kyr ago to 
modify the Vk-FGT1 timescale’*. Uncertainties at and between the tie points 
were estimated by Monte Carlo simulations (Supplementary Information). 

Gas age of the Dome Fuji core for CO, and 5'°O 4, was derived by subtracting 
ice age—gas age difference (Aage) from the DFO-2006 ice age, using a densifica- 
tion model. The Aage depends primarily on accumulation rate, ranging from 1.6 
to 4.7 kyr with an accuracy better than 20%. 

The Dome Fuji temperature AT, i. was reconstructed with an isotopic inver- 
sion using a low-pass filtered deuterium-excess (d-excess) record. Note that the 
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ATyite presented here is not intended to resolve millennial changes because of the 
filtering as well as averaging of AT,,. to 500-yr intervals. Future improvements to 
the ice core data and traverse snow isotope data from the coast to Dome Fuji will 
be conducted. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Data correction and selection. Because O,/N, in the clathrate ice becomes 
gradually depleted over years during sample storage in a warm freezer (for 
example, —25°C) due to diffusive gas loss*, the O,/N, data were corrected 
according to the storage period assuming a linear O,/N, depletion with time 
up to 639days (Supplementary Fig. 2). The samples stored for more than 
800 days do not appear to deplete further, thus those data were corrected by 
the same correction for 639 days. When plotted against age, the corrected O3/N, 
record shows a smoother and more regular variation than the uncorrected data 
(Supplementary Fig. 3), and it resembles the independent Vostok record’, indi- 
cating that the correction is reasonable. We note that using different fits to the 
depletion trend (exponential function or linear trend throughout all storage 
durations) result in identical timescales within 2¢ uncertainty. Because the 
post-coring fractionation of bubble ice is not currently understood’, the data 
from shallower depths (<1,200 m or younger than ~75 kyr ago) are not suitable 
for dating”’. 

After the correction, outliers in the O2/N> data were rejected on the basis of 
deviation from a low-pass filtered curve (Supplementary Fig. 4). 28 points out of 
the original 185 points were rejected with a criterion of 3.2%. The high noise 
levels for the period younger than ~230 kyr ago are attributable to errors in the 
large gas-loss corrections for the samples stored at —25 °C for a long time. Most 
of the outliers indeed consist of the longest-stored samples. The Dome Fuji 
O2/N> data can be improved in the future by measuring ice samples stored in 
a cold freezer (—50 °C; ref. 4) or at Dome Fuji. Measuring O,/N, within a few 
months after drilling would be ideal for future studies. 

Orbital tuning of O,/N, record to local summer solstice insolation. The tuning 
tie points were made between the O,/N, record and summer solstice insolation 
at 77° S by peak-to-peak matching (Fig. 1). Note that use of average insolation 
for a few months around the summer solstice as a tuning target produces ident- 
ical results within 0.2 kyr. For detecting the maxima and minima in the O3/N) 
data, as well as for the outlier rejection, the O,/N, data on a glaciological time- 
scale’? (DFGT-2003) were linearly interpolated at 0.1-kyr intervals, and then 
smoothed by a 559-order finite-duration impulse response (FIR) filter with a 
Kaiser window (sharp cut-off from 16.7 to 10.0 kyr period; pass-band ripple of 
less than 1%; attenuation at stop-band of more than 99.9%), which was designed 
with the Filter Design and Analysis Tool of the MATLAB software. To avoid 
phase modulation, the filter was applied twice in the opposite direction (filtfilt 
function in MATLAB). The peak positions in the filtered curve were used for the 
matching. We checked the filter performance by applying it to the insolation 
curve, and confirmed that the peak positions are not altered by more than 
0.1 kyr. The timings of the peaks in the filtered O2/N> curve were tied to those 
in the local summer solstice insolation. Mid-transitions were not used because 
the relationship between O2/N, and insolation may be nonlinear. Also, the use of 
mid-transition instead of peaks does not improve the precision by our method. 
The DFGT-2003 timescale was linearly interpolated between the tie points to 
derive the O2/N> timescale (DFO-2006). Two tie points at ~221 and ~230 kyr 
ago are transferred from the Vostok O,/N, data because the Dome Fuji data are 
too noisy in this part. We also transferred the tie point at ~333 kyr ago from the 
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Vostok data to better constrain the timescale in the oldest part of the current 
record. 

Ice age-gas age difference. The age of gas in the Dome Fuji core for the 5'*Oatm 
and CO, records was derived by subtracting an estimate of ice age-gas age 
difference (Aage) from the DFO-2006 ice age. The Aage was estimated by a 
semi-empirical firn densification model” after adjustment of model parameters 
to reproduce the present density profile (similar to ref. 8). The uncertainty of 
Aage is estimated by comparing Antarctic Isotope Maxima™ (AIM) in the Dome 
Fuji 5'8O;.. with the abrupt increases of CH, (ref. 31), assuming that these two 
signals are synchronous'*”’. The estimated 2¢ uncertainties are 8-18% of Aage at 
AIM 1, 2, 8, 14, 17, 23 and 24 (the sampling intervals of CH, data for other AIM 
are too low to usefully constrain the uncertainty). We thus conservatively assign 
a constant fraction of +20% as the 2¢ uncertainty of Aage (for example, 0.9 kyr 
in the LGM, 0.5 kyr for the Holocene). 

Dry extraction method for CO, measurements. The dry extraction method for 
the CO, concentration data has been described elsewhere”. Slight modifications 
were made as follows. First, the extraction apparatus was pre-conditioned by hot 
humid air to minimize contamination®. Second, a helium cycle cooler (Janis 
Research) was used instead of liquid helium for cryogenic collection of extracted 
air. Third, shorter sample tubes were used. Forth, the ice vessel was evacuated for 
90 min. 

Site temperature reconstruction. An isotopic inversion technique*”*” was used 
to correct the raw isotopic record for both vapour source temperature and 
marine isotopic composition variations. Owing to the high level of noise in 
the deuterium excess data especially between 20 and 80 kyr ago, we use a filtered 
record (passband > 5,000 yr, stopband < 2,000 yr), as we only discuss the large 
climate variations. We used a recent marine 5'*O,,, reconstruction®’. We estim- 
ate the sensitivities of isotopic values* to site (Dome Fuji) and source tempera- 
tures and marine isotopic composition by using a Rayleigh-based model. We 
tune this model with the Dome Fuji present-day surface data. Unfortunately, we 
currently lack isotopic data of surface snow from the coast to Dome Fuji to 
correctly define the air mass trajectories. Our modelling thus does not offer 
comprehensive constraints, and future improvements are necessary to interpret 
ATyite in detail. Nevertheless, the sensitivities we found are close to those 
obtained for Vostok” and Dome C* (see Supplementary Information). 
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Hadean diamonds in zircon from Jack Hills, 


Western Australia 


Martina Menneken’, Alexander A. Nemchin’, Thorsten Geisler’, Robert T. Pidgeon* & Simon A. Wilde” 


Detrital zircons more than 4 billion years old from the Jack Hills 
metasedimentary belt, Yilgarn craton, Western Australia, are the 
oldest identified fragments of the Earth’s crust’” and are unique in 
preserving information on the earliest evolution of the Earth. 
Inclusions of quartz, K-feldspar and monazite in the zircons’, in 
combination with an enrichment of light rare-earth elements*’ 
and an estimated low zircon crystallization temperature’, have 
previously been used as evidence for early recycling of continental 
crust, leading to the production of granitic melts in the Hadean 
era. Here we present the discovery of microdiamond inclusions in 
Jack Hills zircons with an age range from 3,058 + 7 to 4,252 +7 
million years. These include the oldest known diamonds found in 
terrestrial rocks, and introduce a new dimension to the debate on 
the origin of these zircons and the evolution of the early Earth®"°. 
The spread of ages indicates that either conditions required for 
diamond formation were repeated several times during early Earth 
history or that there was significant recycling of ancient diamond. 
Mineralogical features of the Jack Hills diamonds—such as 
their occurrence in zircon, their association with graphite and 
their Raman spectroscopic characteristics—resemble those of 
diamonds formed during ultrahigh-pressure metamorphism 
and, unless conditions on the early Earth were unique, imply a 
relatively thick continental lithosphere and crust—mantle inter- 
action at least 4,250 million years ago. 

Research on >3,900-Myr-old zircon grains from the Jack Hills has 
mainly focused on their chemical and isotopic compositions. In this 
study, we have investigated the mineral inclusions in 1,000 randomly 
chosen zircon grains, collected from a sample of the Jack Hills 
conglomerate from the initial ‘discovery’ site’. U-Pb ages of the zircon 
grains were determined using a sensitive high-resolution ion micro- 
probe (SHRIMP II), following the analytical procedure described in 
detail elsewhere'’’*. Optical examination revealed that about half of 
the zircon population contains inclusions larger than ~1 um. These 
inclusions were further investigated by confocal micro-Raman spec- 
troscopy. A Raman spectrum of a mineral reflects both its structural 
and chemical properties and can thus be used as a fingerprint for its 
identification. We have identified apatite, quartz, xenotime, monazite, 
rutile, biotite, amphibole, K-feldspar and plagioclase inclusions in 
the Jack Hills zircon, some of which have also been recognized in 
previous studies**. In addition, we unambiguously identified dia- 
mond inclusions in 45 zircon grains, mostly associated with graphite, 
but in some cases with apatite and quartz. 

The Raman spectrum of diamond is characterized by a single first- 
order band near 1,332 cm ' with a typical width (full-width at half- 
maximum, FWHM) of ~1.7 cm‘, reflecting a triply degenerate C-C 
stretching vibration with F,, symmetry, and sometimes by a second- 
order feature near 2,600cm | (ref. 13). The Raman spectra of most 
of the diamond inclusions (‘diamond 1’) in the Jack Hills zircons 


display the characteristic diamond band at a frequency between 1,329 
and 1,332cm ' (Fig. 1a). None of the diamonds show a second- 
order spectral feature near 2,600 cm” '. Some inclusions (‘diamond 
2’), however, show an intense band between 1,320 and 1,327 cm ! 
(Fig. 1b) that is further characterized by a larger width compared with 
that of diamond 1 (Fig. 2). Despite the downshift and broadening, the 
bands are still symmetric (Fig. lc, d). At the margin of some diamond 
inclusions, we obtained spectra that show bands of diamond 1 and 2 
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Figure 1| Representative Raman spectra of diamond and graphite 
inclusions in zircon from Jack Hills. a, Raman spectrum showing the first- 
order Raman band of diamond near 1,332 cm‘ witha relatively small band 
width (diamond 1). b, Representative spectrum of diamond inclusions that 
are characterized by a broadened band located between 1,320 and 

1,327 cm | (diamond 2) ¢, Raman spectra of a diamond inclusion that is 
characterized by two bands, representing diamond 1 and 2, measured with 
different laser intensities. The 100% spectrum was measured with full laser 
power. Note the blue shift of the left band with decreasing laser power, which 
demonstrates sample heating effects and indicates that diamond 2 is 
polycrystalline diamond of variable crystallite sizes. d, Raman spectrum of 
graphite with the broad first-order band (G band) near 1,580cm™' anda 
relative weak disorder band (D1 band) near 1,350cm ' (for a 532nm 
excitation). e, Spectrum of a diamond-graphite composite inclusion. Note 
that the graphite bands are sharper than those shown in c and that an 
additional disorder band near 1,620cm_! is visible (D2 band). 
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overlapping each other. In contrast to the characteristic band of 
diamond 1, the band of diamond 2 shifts towards higher wave- 
numbers, when decreasing the laser power (Fig. 1c), indicating 
heating effects induced in diamond 2 by the incident laser. It has 
been shown" that heating effects in diamond powders of grain sizes 
down to 0.1 lum can cause a size-dependent frequency shift towards 
1,320cm7' that is associated with symmetric band broadening, as a 
result of different thermal properties of microcrystalline diamond. 
Hence, the band width—frequency relationship of diamond 2, seen in 
Fig. 2, reflects the occurrence of polycrystalline diamond of variable 
grain size that is associated with larger diamonds in some inclusions 
(Fig. 1c). 

A major Raman-active vibration near 1,326 cm” ' and an additional 
band near 1,175cm ' (ref. 15), characteristic of lonsdaleite (a hexa- 
gonal polytype of diamond, which has been reported from impact 
structures'®), were not observed in the present study (Fig. 1). 
However, we have identified graphite associated with most diamond 
inclusions by its broad first-order Raman band near 1,580 cm ! (G 
band’) and in some cases by its disorder band (‘D1 band’) near 
1,350cm ‘ (for a 532nm excitation) and its second disorder band 
(‘D2 band’) near 1,620 cm! (Fig. 1d and e)!”. 

The diamond inclusions have rounded to hexagonal, oval (Fig. 3), 
angular, or even needle-like shapes. Commonly, graphite is found 
surrounding diamond, suggesting that it formed after the diamond as 
a result of retrograde transformation. Diamonds that are exposed at 
the polished surfaces of the grains can be recognized by their bright 
cathodoluminescence (CL) intensity, distinguishing them from 
exposed graphite which gives no CL signal (Figs 3a and d). The 
secondary electron image in Fig. 3e shows a relatively large, exposed 
diamond aggregate (giving the spectrum shown in Fig. 1a), which 
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Figure 2 | Raman data from diamond inclusions in the Jack Hill zircons, 
from diamond polishing particles, and from diamonds of different 
geological origin. Shown is the width (given as FWHM and corrected for 
instrumental broadening) of the first-order diamond Raman band as a 
function of frequency for diamond inclusions in zircon crystals from Jack 
Hills and for diamond polishing particles. The line broadening and 
frequency red-shift of diamond 2 reflects sample heating effects as a result of 
variable grain sizes, as marked by the arrow (see Fig. 1). Note that diamonds 
with different Raman spectral features (diamond 1 and 2) occur within all 
age groups of the detrital Jack Hills zircon population. Age groups are 
defined using the corresponding *°’Pb/*™°Pb zircon ages (see Supplementary 
Table 1). Published Raman data*>*®***! and our own measurements (given 
in Supplementary Table 2) of diamonds of different origin (shown as 
coloured fields) are included for comparison. 
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was originally surrounded by graphite that has now been polished 
away. 

The presence of diamond inclusions in zircon raises the question 
of whether these diamonds could be the result of contamination from 
the De Beers diamond polishing powder used in preparing the zircon 
mounts. However, this can be ruled out as (1) some diamond inclu- 
sions are completely enclosed by zircon (Fig. 3b and c), (2) most 
diamonds reveal variable spectral features that clearly distinguish 
them from those of the diamond particles in the polishing powder 
(Fig. 2), (3) the size of the diamond inclusions ranges between 6 and 
70 um so that they are mostly larger than the diamond particles in the 
polishing materials (<12 um), and (4) almost all of the diamonds are 
associated with graphite (Fig. 1d), which is not present in the polish- 
ing powder. 

The diamond-bearing zircon crystals are characterized by highly 
variable internal structures in CL images, typical of the whole Jack 
Hills population®'’. These include simple oscillatory growth zones 
(Fig. 3d) that are typical of zircon crystals grown from a melt or 
fluid phase, and irregularly curved growth zones (Fig. 3a) that indi- 
cate zircon re-equilibration in an aqueous fluid or melt'’. Those 
zones containing diamond show a distribution of *°’Pb/?°°Pb ages 
similar to the average statistical age distribution of the Jack Hills 
zircon population®”’ with a pronounced peak between 3,200 and 
3,400 Myr and with ~10% of the population being older than 
3,900 Myr (Supplementary Table 1). We found no correlation 
between the presence of diamond and the age, the internal growth 
structure, or the Th/U of the host zircon crystals (Supplementary 
Table 1). Moreover, the polycrystalline microdiamonds (diamond 
2) occur in all age groups (Fig. 2). The three oldest diamond-bearing 
zircon grains gave concordant ages (Myr) of 4,096+4 (lo), 
4,132 + 5 (1o) and 4,252 + 7 (1o) (Fig. 3d), making them the oldest 
known diamonds found in terrestrial rocks. The previously known 
oldest terrestrial diamonds, with a maximum age defined by garnet 
inclusions at 3,300 = 200 Myr, are diamonds found in 90-Myr-old 
kimberlites from Finsch and Kimberley in South Africa”’. 

To evaluate the mode of formation of the Jack Hills diamonds, 
it is useful to compare the observed mineralogical features with those 
from diamonds formed under known geological conditions. Natural 
diamonds form under pressures of >3.2 GPa and temperatures 
between ~650 and 1,200 °C in the Earth’s interior at a depth of 
>100 km or during meteoritic impact events”. Until now, diamonds 
have been found in (1) primitive carbonaceous chondrites, (2) me- 
teoritic impact structures, (3) kimberlites and lamproites, which 
delivered them from the subcontinental lithosphere, (4) komatiites, 
and (5) in crustal rocks, typically sedimentary, that have been sub- 
jected to ultrahigh-pressure (UHP) metamorphic conditions”. 

UHP metamorphic terranes are at present the only known setting 
where microdiamonds, about 3—50 tm in size, have been reported as 
inclusions in zircon’**. Also, secondary graphite is commonly 
associated with diamonds that formed in UHP terranes**’” owing 
to the relatively slow exhumation rates compared to the rate of kim- 
berlite magma ascent, which allows the transformation of diamond 
to graphite. We further note that the Raman spectral features of 
diamond 1 agree well with those reported for UHP diamonds**”>**, 
and are readily distinguished from those of shock-related diamonds 
and from diamonds found in kimberlites and lamproites (Fig. 2). The 
latter diamonds are also characterized by a second-order Raman 
band near 2,600cm7' that has not been observed in the present 
study. 

So far we have not detected high-pressure minerals other than 
diamond in the Jack Hills zircon crystals. There is also no evidence 
that the observed quartz inclusions were derived from its high- 
pressure polymorph coesite. The overall mineral inclusion assem- 
blage is essentially granodioritic to granitic (Supplementary Table 1), 
which resembles the mineral paragenesis commonly associated 
with carbonados. Carbonados are large polycrystalline nodules 
that consist of euhedral diamond crystals (typically up to 200 um) 
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Figure 3 | Mineralogical features of diamond inclusions in Jack Hills zircon 
samples. a—c, Grain JH15-142; d-f, grain JH3-20. a, Cathodoluminescence 
(CL) image of grain JH15-142. Gr, graphite. b, Backscattered electron (BSE) 
image of the area around the graphite-diamond composite inclusion shown 
in a. The bright BSE intensity inner region where the inclusion is exposed to 
the surface is from relict gold coating covering graphite. The dimension of 
the inclusion, as visible in the transmitted light image shown in ¢, is outlined 
by the grey curve. c, Transmitted light image of the same inclusion as shown 
in Fig. 1b. The part of the inclusion that is exposed to the surface (see b) is 
outlined by the grey curve. The coloured contours in overlay show the 


set in a matrix of microcrystalline diamond and are associated with 
crustal minerals such as quartz, orthoclase, xenotime and zircon”. 
Their origin is enigmatic, but theories include metamorphism of 
earliest subducted lithosphere, radioactive transformation of mantle 
hydrocarbons, and meteoritic impact on a concentrated biomass”’. 
However, the Raman spectral characteristics of carbonados that we 
have measured are more similar to those of kimberlite- or lamproite- 
derived diamond rather than to those found in Jack Hills zircon 
(Fig. 2). 

The absence of high-pressure minerals except diamond, together 
with the occurrence of diamonds in zircons with a range of primary 
and secondary internal structures (Fig. 3) and the highly variable Th/ 
U ratios (Supplementary Table 1), which do not support formation 
of zircon at UHP metamorphic conditions, presents a significant 
problem for the interpretation of the origin of the diamonds and 
the crystallization history of the zircons. These observations suggest 
that the diamonds and at least some of their host zircons were formed 
under different pressure-temperature conditions. It is possible that 
the diamonds were formed during a single event before or at 4,252 + 
7 Myr ago, which is the crystallization age of the oldest diamond- 
bearing zircon from the Jack Hills conglomerate (Fig. 3d). This 
hypothesis may be supported by the Hf-isotope compositions pre- 
viously determined for Jack Hills zircon”*’, which were interpreted to 
reflect extensive reworking of material separated from the mantle 
before about 4,140 Myr ago*’. Diamonds found in younger zircon 
grains would then be a result of recycling of the older material. 
Alternatively, diamonds could have been incorporated in zircon at 
several periods in early Earth history by repetition of the process that 
resulted in diamond formation. 

Many questions remain regarding the origin of the diamonds and 
their incorporation in the Jack Hills zircons. However, the compar- 
ison of the observed mineralogical features of these diamonds with 
those formed under known geological conditions, particularly their 
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intensity distribution of the first-order Raman band of diamond near 
1,332 cm _!. Note that the diamond is included inside the zircon. Qtz, quartz. 
d, CL image of grain JH3-20. Dia, diamond. e, Secondary electron image of 
inclusion 20-3, which is visible in the right corner of the zircon grain shown 
in d. f, Carbon distribution of inclusion 20-3 as obtained by energy- 
dispersive X-ray analysis. Note that carbon signals at the border of the 
inclusion result from remnant graphite. White ellipses in a and d mark the 
SHRIMP analytical sites; numbers show the 7°’Pb/*”°Pb ages (in Myr) with 
the lo uncertainty. Intensity variations in ¢ and f scale from blue (low 
intensity) to red (high intensity). 


Raman spectral characteristics and their occurrence with graphite in 
zircon, indicates that they share more similarities with UHP meta- 
morphic diamonds than with any other known type. If correct, and 
unless unknown processes operated during the Earth’s earliest his- 
tory, this would imply the presence of a relatively thick continental 
lithosphere and that crust—-mantle interaction occurred on Earth as 
early as 4,250 Myr ago. 


METHODS SUMMARY 

Raman spectroscopy. Laser-Raman spectra were collected with a Jobin Yvon 
HR800 dispersive Raman spectrometer using the 532 nm line of a 14mW Nd- 
YAG laser. The scattered Raman light was analysed with a 100 X objective in 180° 
backscatter geometry and a charged-coupled device (CCD) detector after being 
dispersed by a grating of 600 grooves mm |. The diamond inclusions were re- 
measured for quantitative determination of the position and width of the first- 
order diamond band with a spectral resolution of 1.1 cm ! (near 1,330cm7!) 
using a grating of 1,800 grooves mm '. 

Raman images of the diamond-graphite-(quartz) composite inclusion 

(Fig. 3c) as well as measurements of the diamond powder were performed with 
an ISA LabRam dispersive Raman spectrometer using the 632.187 nm line of a 
He-Ne laser and a grating of 1,800 grooves mm‘, yielding a spectral resolution 
of 16cm. 
Sensitive high-resolution ion microprobe (SHRIMP). The filtered (O2 ) beam 
with intensity between 2 and 3nA was focused on the surface of samples into 
~20 um spot. Secondary ions were passed to the mass spectrometer operating at 
a mass resolution (M/AM) of ~5,000. Each analysis was preceded by a 2 min 
raster to remove the Au coating. The peak-hopping data collection routine 
consisted of five scans through the mass stations, with signals measured by an 
ion counting electron multiplier. Pb/U ratios were calibrated using an empirical 
correlation between Pb*/U* and UO*/U* ratios, normalized to the Curtin 
University standard (a 564-Myr-old Sri Lankan zircon). The 0.8-1.6% error 
obtained from the multiple analyses of Pb/U ratio on the standard during indi- 
vidual SHRIMP sessions was added in quadrature to the errors observed in the 
unknowns. 


919 


©2007 Nature Publishing Group 


LETTERS 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Sample preparation. Zircon grains (1,000) were hand-picked from a previously 
prepared >135 um concentrate and mounted onto double-sided adhesive tape, 
along with pieces of the Curtin University Sri Lankan gem zircon (CZ3) used as 
standard for the SHRIMP measurements. They were enclosed in epoxy resin 
disks, ground so as to effectively cut all zircon grains in half, and polished with 
<12 um De Beers diamond polishing powder. 

Raman spectroscopy. Laser-Raman spectra of the inclusions were collected with 
a Jobin Yvon HR800 dispersive Raman spectrometer using the 532 nm line of a 
Nd-YAG laser with a laser power of 14mW. The scattered Raman light was 
analysed with a CCD detector after being dispersed by a grating of 600 groo- 
vesmm |. A 100X objective with a numerical aperture of 0.9 was used on a BX- 
40 microscope. The diamond inclusions were re-measured with a grating of 
1,800 grooves mm ' for quantitative determination of the position and width 
of the first-order diamond band near 1,332 cm !. With the 1,800 groovesmm 7 
grid, the spectral resolution near 1,332 cm! was 1.1 cm7!, as measured by neon 
emission lines. The following equation was used to correct for the effect of the 
finite slit width on measured band width (given as FWHM)”: 


P=T,(1- (S/Pn)"] 


where I’, J\, and Sare the corrected band width of the first-order diamond band, 
the measured band width, and the spectral slit width, respectively. The frequency 
was calibrated using the first-order Si line at 520.7cm | and lines from a neon 
lamp. The standard errors from the fitting procedure of both the Raman fre- 
quency and width were usually less than 0.2cm7'. 

We checked for possible heating effects induced by the incident laser power on 
the Raman spectral features. Larger diamond inclusions (diamond 1) do not 
show any heating effects as checked by (1) reducing the laser power with filters 
and (2) measuring the integral intensity of the Stokes and anti-Stokes compon- 
ent in the diamond Raman spectra, which allows the calculation of the local 
heating temperature" (see Fig. 1c) However, some diamonds (diamond 2) show 
temperature-induced spectral changes, which are used to discriminate between 
the two types of diamond (see Fig. 1c and Fig. 2). 

Raman images of the diamond-graphite-(quartz) composite inclusions as well 

as measurements of the diamond powder were performed with an ISA LabRam 
dispersive Raman spectrometer using the 632.187 nm line of a He-Ne laser and a 
grating of 1,800 grooves mm’ '. Raman images were obtained after the SHRIMP 
measurements on re-polished samples. The spectral resolution was 1.6cm™! 
near 1,332cm '. The lateral spatial resolution was about 1-2 um. For imaging, 
a confocal hole of 500 tm was used. 
Sensitive high-resolution ion microprobe (SHRIMP). The SHRIMP meth- 
odology follows analytical procedure described elsewhere'’'?. The filtered 
(O2 ) beam with intensity between 2 and 3nA was focused on the surface 
of samples into an ~20m spot. Secondary ions were passed to the mass 
spectrometer operating at a mass resolution (M/AM) of ~5,000. Each analysis 
was preceded by a 2 min raster to remove the Au coating. The peak-hopping 
data collection routine consisted of five scans through the mass stations, 
with signals measured by an ion counting electron multiplier. Pb/U ratios 
were calibrated using an empirical correlation between Pb*/U* and UO*/U* 
ratios, normalized to the 564-Myr-old Sri Lankan zircon CZ3°*’. The 0.8-1.6% 
error obtained from the multiple analyses of Pb/U ratio on the standard 
during individual SHRIMP sessions was added in quadrature to the errors 
observed in the unknowns. The initial data reduction was done using the 
SQUID add-in for Microsoft Excel**, and Isoplot** was applied for further age 
calculations. 
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A new species of great ape from the late 


Miocene epoch in Ethiopia 


Gen Suwa’, Reiko T. Kono’, Shigehiro Katoh®, Berhane Asfaw* & Yonas Beyene” 


With the discovery of Ardipithecus, Orrorin and Sahelanthropus'”, 
our knowledge of hominid evolution before the emergence of 
Pliocene species of Australopithecus*” has significantly increased, 
extending the hominid fossil record back to at least 6 million years 
(Myr) ago. However, because of the dearth of fossil hominoid 
remains in sub-Saharan Africa spanning the period 12-7 Myr ago, 
nothing is known of the actual timing and mode of divergence of 
the African ape and hominid lineages. Most genomic-based studies 
suggest a late divergence date—5—6 Myr ago and 6-8 Myr ago for 
the human-chimp and human-gorilla splits, respectively'° *—and 
some palaeontological and molecular analyses hypothesize a 
Eurasian origin of the African ape and hominid clade’*”*. We report 
here the discovery and recognition of a new species of great ape, 
Chororapithecus abyssinicus, from the 10—10.5-Myr-old deposits of 
the Chorora Formation at the southern margin of the Afar rift. To 
the best of our knowledge, these are the first fossils of a large- 
bodied Miocene ape from the African continent north of Kenya. 
They exhibit a gorilla-sized dentition that combines distinct 
shearing crests with thick enamel on its ‘functional’ side cusps. 
Visualization of the enamel—dentine junction by micro-computed 
tomography reveals shearing crest features that partly resemble the 
modern gorilla condition. These features represent genetically 
based structural modifications probably associated with an initial 
adaptation to a comparatively fibrous diet. The relatively flat cuspal 
enamel-dentine junction and thick enamel, however, suggest a 
concurrent adaptation to hard and/or abrasive food items. The 
combined evidence suggests that Chororapithecus may be a basal 
member of the gorilla clade, and that the latter exhibited some 
amount of adaptive and phyletic diversity at around 10-11 Myr ago. 

The Chorora Formation represents the earliest known record of 
sedimentation within the Afar rift of Ethiopia'’. Perilacustrine sedi- 
ments and their palaeontological contents were first reported in the 
1970s (ref. 18) and estimated to be between 10 and 11 Myr old’?”*. 
More recent re-evaluations have confirmed this chronology from 
both radioisotopic dating and biochronological considerations’. 
Before now, the Chorora Formation large mammal fauna was known 
from a fragmentary fossil assemblage recovered from a single locality, 
the ‘type site’, and included a primitive Stegotetrabelodon proboscid- 
ean, a large giraffid, a primitive hippopotamid, and a hipparionine 
equid'”'*. The latter is the most abundant taxon represented, and is 
considered the earliest known record of an equid in Africa with a best 
age-estimate of 10.7—10.1 Myr old”'. 

The Chorora fossils have been seen as a key assemblage for addres- 
sing both within-African and intercontinental mammalian evolu- 
tionary patterns*'. This is because of Chorora’s intermediate 
geographic position between sub-Saharan Africa and Eurasia, as well 
as the paucity of African fauna spanning the time periods 12—7 Myr 


ago. Despite such expectations, the fragmentary assemblages hitherto 
known have not realized the site’s potential. 

We report here the first primate fossils recovered from the Chorora 
Formation. These were discovered during newly initiated systematic 
surveys from 2005 through to 2007 that aimed to locate palaeonto- 
logical localities other than the relatively well-researched Chorora 
Formation type site. Primate remains were recovered at the Beticha 
locality, 3km from the type site exposures, and include a macaque- 
sized cercopithecoid monkey (represented by a partial phalanx) and 
teeth of a large bodied hominoid primate. The primate fossils derive 
from the upper portions of the Chorora Formation, higher in the 
section than the type site assemblage. Their age is best considered 
as approximately 10.0 to 10.5 Myr old (Fig. 1, and Supplementary 
Information). 

The large-bodied ape is represented by a canine and eight partial 
molars from at least three, and perhaps six or more, individuals 
(Fig. 1, and Supplementary Information). These teeth are collectively 
indistinguishable from modern gorilla subspecies in dental size and 
represented proportions (Supplementary Information). This modest 
sample nevertheless exhibits substantial size variation, with molars at 
both the largest and smallest end of the modern gorilla ranges of 
variation. The species also exhibits a suite of subtle but derived char- 
acteristic molar features that are shared with modern gorillas (see 
below). Functionally, they indicate incipient morphological speciali- 
zation towards a relatively fibrous diet. Phylogenetically, these fossils 
represent the first Miocene ape species to be recognized as a strong 
candidate for membership in the modern gorilla clade. 


Order Primates 
Superfamily Hominoidea Gray, 1825 
Chororapithecus gen. nov. 
Chororapithecus abyssinicus sp. nov. 


Etymology. The genus name is taken after Chorora, the name of the 
geological formation from which the fossils are derived; Chorora is 
the name of a local village approximately 8 km south of the Beticha 
locality. The species name is taken from Abyssinia, the former name 
of Ethiopia, in light of the geographical location of discovery at the 
junction area of the Main Ethiopian and Afar rift systems. 
Holotype. CHO-BT 4 right upper M’ (second molar). 

Paratype. CHO-BT 3, left lower canine; CHO-BT 5, right upper M’; 
CHO-BT 6, right upper M’; CHO-BT 7, left lower M,; CHO-BT 8, 
left lower M,; CHO-BT 9, left lower molar fragment; CHO-BT 10, 
right lower molar fragment; and CHO-BT 11, right upper M°. 
Diagnosis. A hominoid primate with molars of equivalent size to 
those of Gorilla gorilla subspecies that combine a tendency for 
enhanced shearing structures (relatively long mesial protocone crest 
of the upper molars and high trigonid crest of the lower molars) and 
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Figure 1| Chororapithecus abyssinicus fossils and schematic stratigraphy. 
a, CHO-BT 4, mesiodistal dimension 19.1 mm, buccolingual dimension 
18.0 mm. b, CHO-BT 5, mesiodistal dimension 16.9 mm, buccolingual 
dimension 17.5 mm. c, CHO-BT 6, mesiodistal dimension 13.2 mm, 
buccolingual dimension 15.0 mm. d, CHO-BT 7, mesiodistal dimension 
19.1 mm, buccolingual dimension 15.9 mm. e, CHO-BT 8, buccolingual 


relatively low cuspal topography with thick enamel at the ‘functional’ 
side cusps (lingual in upper and buccal in lower molars). The upper 
molars tend to be buccolingually narrow and mesiodistally elongate 
in shape, with a mesiobuccally extending mesial protocone crest. The 
epicrista is thin and buccolingually straight, creating a narrow mesial 
fovea located buccally. The high and continuous trigonid crest of the 
lower molars is situated distinctly above the level of a low mesial 
marginal ridge. The metaconid is moderately tall, forming a deep 
lingual notch in comparison with other large-bodied Miocene homi- 
noids. In both upper and lower molars, cusp tips are relatively peri- 
pherally placed and cingular expressions are weak. 


Figure 2 | Micro-CT-based morphological evaluations of the outer enamel 
surface and enamel-dentine junction. Three-dimensional rendered views of 
right upper M’ and M’ (a), and left lower Mz and M; (b). ¢, Lower molars 
mirror-imaged and placed in an occlusal relationship with the upper molars; 
lingual view, mesial to the left. Note the shearing relationship between 
mesiolingual protocone and the distal slope of the metaconid. Mesial view of 
Chororapithecus (d) and Gorilla (e) lower Ms, with the mesial marginal ridge 
broken or cut off, showing a prominent trigonid crest. Natural section of 
Chororapithecus lower M, metaconid break (mirror-imaged, f), and 
equivalent micro-CT section of a Gorilla molar (g). h, Chororapithecus lower 
M, longitudinal section at mid-protoconid, its position as indicated in j, and 
equivalent section of a Gorilla molar (i). Note the prominent trigonid crest of 
the ED] (left arrows) located distinctly above the level of the mesial marginal 
ridge (right arrowheads). Micro-CT visualization of EDJ surfaces of 
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dimension 13.2 mm. f, CHO-BT 9. g, CHO-BT 10. h, CHO-BT 11. i-k, CHO- 
BT 3 buccal, distal and mesial views, respectively, maximum diameter 

13.9 mm, perpendicular diameter 10.7 mm. All dimensions except for those 
of CHO-BT 3 are estimated values corrected for minor damage. Simplified 
stratigraphy (type site section after ref. 20, radioisotopic age determinations 
from refs 17 and 20; see Supplementary Information for details). 


Differential diagnosis. Chororapithecus abyssinicus differs from 
Gorilla gorilla subspecies by having molar crown and cusps lower 
in height and with thicker enamel; upper molars with less distinct 
crista obliqua; and lower molars lacking buccolingual infolding of 
the buccal occlusal margin (that is, a lack of extremely enlongated 
shearing crests). C. abyssinicus differs from all known modern and 
fossil hominoids (including the nearly contemporaneous African 
genus Samburupithecus), with the exception of Gorilla, Rangwapi- 
thecus, Nyanzapithecus and Oreopithecus, in enhanced shear between 
the mesial protocone crest and the distal occlusal slope of the meta- 
conid. Furthermore, C. abyssinicus is different from Samburupithecus 


MCP versus PAPR 


0.10 0.20 
MEX versus BL 


Chororapithecus (k, m) and Gorilla (1) upper M’s. Occlusal view of 
Chororapithecus upper M” ED] (k), showing the metrics evaluated in the right 
lower scatter plot. MCP-versus-PAPR is the distance between protocone tip 
and protoconule (or equivalent point) divided by paracone—protocone cusp 
tip distance, both measured on the EDJ. MEX-versus-BL is the projected 
length of the mesial fovea (measured from the paracone—protocone plane) 
divided by buccolingual crown breadth. Filled red squares, Chororapithecus; 
open squares, Gorilla; open diamonds, Pongo; open circles, P. troglodytes; 
filled circles, P. paniscus; filled triangles, Australopithecus africanus (pointing 
up) and Australopithecus robustus (pointing down). See the Supplementary 
Information for methodological details and comparative sample 
compositions. Note the linear to slightly concave mesial protocone crest of 
CHO-BT 4 resembling the Gorilla condition but with lower crown and cusp 
topography. Scale bar, 1 cm (common to all images). 
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because it has molars with a more open occlusal basin, weak expres- 
sion of the anterolingual cingular complex, and a continuous crista 
obliqua (of M’), as opposed to a crest incised by the longitudinal 
groove. C. abyssinicus differs from Proconsul major, P. nyanzae 
and ‘Ugandapithecus gitongai* in its larger size, buccolingually 
narrower upper molars, more peripherally located molar cusps, 
resulting in a more open occlusal basin, thin and weak cingular 
expressions, and lack of tendency for a strong expression of the 
protoconule. Also, C. abyssinicus differs from Ouranopithecus in 
having a larger upper M* with a more mesiodistally elongate shape, 
better developed occlusal crests, and lack of lower M3 posterior 
cusp development (hypoconulid and cusp 6 regions small); from 
Ankarapithecus and Sivapithecus parvada in its larger size, buccolin- 
gually narrower upper molars, and lack of tendency for a strong 
expression of the protoconule; and from Gigantopithecus in being 
smaller in size and possessing lower crowned molars, buccolingually 
narrow upper molars, better developed occlusal crests, and a lower 
canine with interlocking relationship with the upper canine. Other 
comparative remarks are given below and in the Supplementary 
Information. 

The most distinctive features of the Chororapithecus dentition are 
the derived shearing structures seen in portions of its molars (Fig. 2), 
despite a generally low cuspal topography (the latter is apparently a 
primitive retention). 

Examination of internal morphology by micro-computed tomo- 
graphy (micro-CT) demonstrates that these occlusal features were 
underlain by distinct enamel—dentine junction (EDJ) structure 
(Fig. 2). In particular, the straight to weakly concave mesial protocone 
crest seen in the EDJ of CHO-BT 4, -BT 5 and -BT 6 is gorilla-like, and 
is formed by a mesiobucally located junction of the mesial protocone 
crest and mesial marginal ridge. Such spatial placements are best 
considered to be regulated by enamel-knot-related signalling patterns 
during early morphogenesis***, and may be one of the underlying 
causes of the mesiodistally elongate upper molar shape generally char- 
acteristic of folivorous primate species. In the lower molars, the most 
distinctive EDJ topography occurs at the trigonid crest, the structural 
counterpart that occludes with the upper molar mesial protocone 
crest. The high trigonid EDJ crest is continuous between the meta- 
conid and protoconid cusp tips (Fig. 2). Because recent experimental 
and quantitative genetic studies suggest significant degrees of mor- 
phogenetic independence between corresponding upper and lower 
molar structures*””*, the presence of a functionally integral inter-jaw 
pattern of morphological expression, as seen in the Chororapithecus 
molars, suggests adaptation by natural selection, as opposed to chance 
emergence of neutral morphological minutia. 

For the features in which Chororapithecus differs from gorillas, the 
gorilla condition is clearly the more derived towards advanced molar 
shear. In particular, Chororapithecus exhibits a much lower cusp relief 
and thicker molar enamel (Fig. 3). Maximum radial lateral thickness 
of the ‘functional’ side cusps (which form the main occlusal areas for 
crushing and grinding) of Chororapithecus appears to scale isometri- 
cally to that of Australopithecus, whereas enamel thickness of the 
opposite side cusps is equivalent to the intermediate conditions of 
Pongo and Ardipithecus. Enamel thickness is thought to probably 
have a significant developmental component directly related to 
steepness of EDJ topography”. Scaled by basal crown area, enamel 
volume of Chororapithecus is intermediate between the Gorilla and 
Australopithecus conditions (Fig. 3). Thus, the higher crown and 
larger EDJ surface area of the gorilla molar would in part explain 
its thinner enamel coverage. Although the gorilla condition could 
potentially be attained from a thicker-enamelled ancestor, it is dif- 
ficult to evaluate whether Chororapithecus itself could represent such 
an ancestral condition. This is because descriptions of EDJ topo- 
graphy and enamel thickness patterns among Miocene hominoids 
are limited. However, the thick enamel of the Chororapithecus ‘func- 
tional’ side cusps, and the extremely low EDJ topography seen in one 
of the upper molars (CHO-BT 4) indicate that Chororapithecus itself 
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is probably too specialized to represent a direct ancestral condition of 
the modern gorilla. 

Thus, we consider Chororapithecus to exhibit a derived molar mor- 
phology indicative of incipient adaptation to a comparatively fibrous 
diet, as indicated by the emphasis of shear between the mesial proto- 
cone crest and the distal slope of the metaconid. At the same time, the 
low relief of the ‘functional’ side cusps endowed with thick enamel 
enable prolonged retention of an occlusal platform of rigid enamel 
(without exposing the more compliant dentine). This is reminiscent 
of three-dimensional enamel distribution patterns seen in Pongo and 
may be an adaptation for consuming hard and brittle food items”. 
With such a unique combination of molar structures, the shearing 
crests of Chororapithecus may have functioned in the folding and 
pulverizing of fibrous foodstuff, rather than the actual cutting of 
fibrous materials by sharp, shearing crests, as in advanced folivores 
with higher cusps. So far, among the diversity of known African and 
Eurasian middle- to large-bodied Miocene apes, with the exception of 
Oreopithecus (which is even more uniquely specialized), we are not 
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Figure 3 | Micro-CT-based evaluations of upper molar enamel thickness. 
Bivariate plots of In(maximum radial enamel thickness) of the lateral crown 
faces of lingual ‘functional’ side cusps (a), and buccal ‘non-functional’ side 
cusps (b) by In(crown area). The isometric line going through the mid-point 
of the Ardipithecus ramidus molars’ (n = 2, crosses) is drawn for reference. 
Other symbols are as in Fig. 2. Reliable micro-CT-based estimates were 
obtained for three Chororapithecus upper molars as follows. CHO-BT 4: 
paracone, 1.64 mm; protocone, 2.53 mm; hypocone, 2.62 mm; CHO-BT 5: 
paracone, 2.32 mm; metacone, 2.28 mm; protocone, 3.28 mm; hypocone, 
3.00 mm; CHO-BT 6: paracone, 1.62 mm, protocone, 2.41 mm. 
Chororapithecus shows a unique combination of thick enamel on the 
‘functional’ side cusps, and intermediate thickness on the ‘non-functional’ 
side cusps. ¢, Log bivariate plot of AETMA (enamel volume/crown area) by 
crown area. Isometric line through the CHO-BT 4 estimate is shown for 
reference. The three crosses for CHO-BT 4 indicate the raw uncorrected 
enamel volume (lowest cross), correction for macro-damage (middle cross), 
and an additional 5% increase estimated for weathering and minor wear loss 
(upper cross). The degree of this correction in a representative section is 
depicted in d. Yellow, enamel; red, restored enamel crown. Other symbols 
are as in the previous plots. Note that with this measure of average enamel 
thickness, gorillas have enamel as ‘thick’ as the other apes; that is, despite the 
thinner enamel of the gorilla, total enamel volume available per crown area is 
comparable with other modern ape species, indicating broad equivalence in 
durability against abrasion. Note that Chororapithecus is intermediate 
between apes and Australopithecus. See Supplementary Information for the 
details of methodology and sample compositions. 
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aware of a single instance of a coherent set of derived features shared 
with the modern gorilla like those seen in Chororapithecus. 

The similarities seen between the two genera raise the possibility 
that Chororapithecus is a Miocene member of the Gorilla clade. 
Alternatively, with its combination of thick enamel and distinct 
molar cresting pattern, Chororapithecus may represent a unique 
adaptation that is convergent with gorillas in molar structure and 
function. Although the evidence for phylogenetic affinity between 
Chororapithecus and Gorilla is inconclusive, it may be that the basal 
members of the gorilla clade shared large tooth size and incipiently 
enhanced molar shear as a part of an herbivorous diet that accom- 
panied (presumed) larger body size. Chororapithecus may then 
represent one example of adaptational (and perhaps phyletic) differ- 
entiation within that clade. 

Acceptance of Chororapithecus as a basal member of the gorilla 
clade would push back the gorilla species split to >10.5 Myr ago. 
Because this is a minimum date established from a meagre fossil 
record, the actual divergence would have predated this by an 
unknown time gap. From the currently available evidence, we con- 
sider that a species split of ~20 Myr ago for Pongo, 12 Myr ago for 
Gorilla, and 9 Myr ago for Pan are all probable estimates (see Sup- 
plementary Information). We consider that the early divergence 
hypothesis is congruent with both fossil and molecular data, and 
should be further evaluated using both sides of the evidence. 

At Chorora, previous systematic field research at the type site has 
yielded a ‘Hipparion’-dominated fauna, which was considered to 
indicate a significant representation of open habitats in possibly a 
mosaic environment”, an interpretation corroborated by recent iso- 
topic and wear studies of “Hipparion teeth*'. To the contrary, the 
Beticha locality is poorly fossiliferous, dominated by primates, and 
almost lacking in ‘Hipparion’ (Supplementary Information). The 
Beticha fossils occur at a higher stratigraphic level, within sediments 
formed by a braided river system with intermittent palaeosol for- 
mation (Supplementary Information). We hypothesize that this 
indicates a mosaic that includes a forested lake margin environment 
formed towards the end of sedimentation of the Chorora Formation 
palaeobasin, while episodic rifting was locally ceasing. Although the 
evidence is so far limited, this suggests that, in the changing landscape 
of the African middle to late Miocene, ape species were actually 
present in both what are now Kenya””® and Ethiopia. The available 
contextual evidence suggests that they predominantly occupied 
the more forested environments, and that the modern African 
ape and hominid clades probably emerged from a late Miocene 
African diversity of apes, which is only starting to be revealed. 
Palaeontologically, our results indicate that more focus is needed 
on comparatively poorly fossiliferous sediments that sample the 
habitats that were preferred by the Miocene apes. 
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Non-random coextinctions in phylogenetically 
structured mutualistic networks 


Enrico L. Rezende’, Jessica E. Lavabre’, Paulo R. Guimaraes Jr’, Pedro Jordano' & Jordi Bascompte’ 


The interactions between plants and their animal pollinators and 
seed dispersers have moulded much of Earth’s biodiversity’’. 
Recently, it has been shown that these mutually beneficial inter- 
actions form complex networks with a well-defined architecture 
that may contribute to biodiversity persistence*®. Little is known, 
however, about which ecological and evolutionary processes gen- 
erate these network patterns*’. Here we use phylogenetic meth- 
ods'*'' to show that the phylogenetic relationships of species 
predict the number of interactions they exhibit in more than one- 
third of the networks, and the identity of the species with which 
they interact in about half of the networks. As a consequence of the 
phylogenetic effects on interaction patterns, simulated extinction 
events tend to trigger coextinction cascades of related species. This 
results in a non-random pruning of the evolutionary tree” and a 
more pronounced loss of taxonomic diversity than expected in the 
absence of a phylogenetic signal. Our results emphasize how the 
simultaneous consideration of phylogenetic information and net- 
work architecture can contribute to our understanding of the 
structure and fate of species-rich communities. 

Plant and animal species establish mutually beneficial interactions 
such as pollination and seed dispersal that can form complex net- 
works of dependency. Recent work has characterized the architecture 
of mutualistic networks, with the ultimate goal of understanding 
their formation and maintenance and the coevolution of species 
within them. These networks are very heterogeneous (some species 
have a much larger number of interactions than expected by 
chance)*, and nested (specialists interact with proper subsets of the 
species that generalists interact with)°, and are built on weak and 
asymmetric links (for example, if a plant species depends strongly 
on an animal species, the dependence of the animal on that plant is 
much weaker)°. The next step on the road to understanding these 
networks is to disentangle the contribution of different evolutionary 
and ecological processes in generating their patterns*”"*. 

Here we ask to what extent network architecture is associated with 
species phylogenetic relationships (Fig. 1), and whether coextinction 
cascades following a species disappearance’ involve phylogenetically 
related (that is, non-randomly sampled) species. The presence of a 
phylogenetic signal, where patterns of interactions between species can 
be partly explained by phylogenetic relatedness, would suggest that net- 
work patterns are partially dependent on past evolutionary history, and 
so cannot be exclusively explained by current ecological processes'*”. 

We compiled the largest data set of plant—-animal mutualistic inter- 
actions, comprising 36 plant—pollinator and 23 plant—frugivore 
mutualistic networks spanning a broad geographic range (data sets 
are available as Supplementary Information). For each network, we 
reconstructed the phylogenies of the animals and plants (see Supple- 
mentary Methods). We then characterized two components of net- 
work architecture. First, we considered the number of interactions 


per species, that is, species degree*'®, and its quantitative extension, 
species strength. The strength of a plant species, for instance, is 
defined as the sum of dependences or interaction weights of the 
animal species on that plant®. These simple components of network 
architecture reflect the generalization level of a given species and its 
quantitative relevance in terms of how other species depend on it. 
Second, we considered the identity of each species’ interactors 
(Methods). Both the number of interactions per species, and the 
identity of the species with which they interact have been identified 
as major determinants of network architecture and robustness”'?”°, 
With the phylogenies and these components of network structure at 
hand, we applied phylogenetic statistical tools'*'' to characterize the 
extent to which closely related species tend to have similar patterns of 
interactions (Fig. 1). 
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Figure 1| A phylogenetic approach to mutualistic networks. We test to 
what extent the architecture of coevolutionary networks is associated with 
evolutionary history conveyed in the phylogenies of plants and animals. A 
plant (green circles) and an animal (red squares) are linked if the latter is a 
pollinator or a seed disperser of the former. Symbol size is proportional to its 
number of links. a-c, Examples where phylogeny accurately predicts the 
number of interactions (a), phylogeny does not predict the number of 
interactions (b), and the real correspondence in one network (c) (see NCOR 
in the Supplementary Methods). 


Integrative Ecology Group, Estacion Biolégica de Dofiana, CSIC, Apdo. 1056, E-41080 Sevilla, Spain. “Instituto de Fisica ‘Gleb Wataghin’, UNICAMP, 13083-970, Campinas, Sao Paulo, 


Brazil. 


925 


©2007 Nature Publishing Group 


LETTERS 


Randomization tests (see Methods) suggest a significant phylogen- 
etic signal in species degree in 24.8% of the data sets (26 of 105 
phylogenies, one-tailed P< 0.05; Fig. 2a). Power analyses indicate 
that these estimates are highly conservative, suggesting that phylo- 
genetic effects may be present in a larger proportion of the com- 
munities (Supplementary Figs 1 and 2). Also, the probability of 
detecting a signal in species degree seems to increase with phylogeny 
size (for example, 54.5% of the 22 phylogenies with more than 70 
species showed significant signal; Supplementary Fig. 1). The nega- 
tive association between the amount of phylogenetic signal estimated 
as Kand phylogeny size (Fig. 2a) is probably an artefact. Even though 
the expectation of Kis 1 under the null hypothesis of the true phylo- 
geny, its lower bound decreases with the number of species in the 
phylogeny. Alternatively, Ornstein—Uhlenbeck branch-length trans- 
formation methods'' (Methods) supported a significant signal in 
degree in 36.2% of the phylogenies where analyses converged (25 
out of 69). P-values from randomization and branch-length trans- 
formation tests were highly positively correlated (P< 0.001), indi- 
cating that these two tests provide similar results. Thus these results 
show that phylogenetically related species have a similar number of 
interactions per species in at least one out of four phylogenies en- 
compassing 39.0% of the networks. 

Conversely, significant signal in species strength is present in only 
2.6% of the phylogenies according to randomization tests (1 of 38; 
Fig. 2b), and in 20.8% of the phylogenies where branch-length trans- 
formation converged (5 of 24). The amount of phylogenetic signal for 
species strength was significantly lower than estimates for degree 
(paired t-test between log-transformed K values, 47 = 1.806, one- 
tailed P = 0.039; Fig. 2c). In spite of the significant positive correla- 
tion between degree and strength’, estimates of species strength may 
be subject to higher levels of uncertainty associated with proximate 
factors such as species abundance variability, changes associated with 
phenological sequences, and sampling errors that tend to decrease 
phylogenetic signal'!. This hypothesis could be tested by determining 
how estimates of species degree and strength vary over time in the 
same communities, the expectation being that species strength would 
show larger fluctuations than would degree. Alternatively, one could 
test whether the signal for strength increases after normalizing by 
species abundance. Nonetheless, our results suggest that species 
degree has stronger phylogenetic signal than strength. 
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Turning now to our second component of network architecture— 
which species interacts with which—we tested whether phylogenetic 
relatedness correlates with ecological similarity. The ecological sim- 
ilarity of any two species is defined as the number of species with 
which they both interact divided by the total number of species with 
which they interact (Methods). Phylogenetic and ecological distance 
matrices are positively and significantly correlated in 42.7% of the 
phylogenies (44 of 103, one-tailed Mantel test, P< 0.05). This means 
that phylogenetically related species tend to interact with a similar set 
of species. To determine whether this result is a consequence of the 
phylogenetic signal in degree reported above, we repeated these tests, 
controlling for differences in the number of interactions per species 
(partial Mantel test, see Methods). The results remained qualitatively 
similar. Partial Mantel correlations are significant in 46.6% of the 
phylogenies, supporting the idea that phylogeny is associated with 
the identity of the species’ interactors after controlling for degree. 

The association between phylogenetic resemblance and ecological 
similarity tends to be more common among animals: 60.8% of 
Mantel correlations between ecological and phylogenetic distance 
matrices were significant for animals, whereas 25.0% were significant 
for plants (Fig. 3). In addition, comparison of the Mantel coefficients 
Z for plants and animals indicated that animal phylogenies were 
more strongly associated with species interaction patterns than were 
plant phylogenies (paired t-test between log-transformed values, 
t43 = 3.218, one-tailed P= 0.001; Fig. 3). Further, these results are 
robust even when the number of animal and plant species is statist- 
ically controlled (analysis of covariance, ANCOVA, F100 = 10.16, 
P= 0.02). Results from partial Mantel tests controlling for degree 
were qualitatively similar (paired fttest, t4; = 2.576, one-tailed 
P= 0.014). It would be interesting to investigate multiple alternative 
hypotheses, such as differences in mobility and evolvability*’~’, to 
determine the cause of this difference. 

Although network structure seems significantly more associated 
with animal phylogenies, structure may be driven by the evolutionary 
history of both plants and animals (7 of 44 communities), only plants 
or only animals (3 and 21 communities, respectively), or neither 
plants nor animals (the remaining 13 communities). This highlights 
the large variability across networks, indicated by the residual vari- 
ation of K or Z after controlling for phylogeny size (Figs 2 and 3; 
Supplementary Table 1). Part of this variability is related to the taxo- 
nomic diversity of the plant lineages, though apparently not to the 
diversity of frugivores (fruit-eaters) or pollinators according to 
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Figure 2 | Magnitude of phylogenetic signal on the number and strength of 
mutualistic interactions. Relationship between the magnitude of 
phylogenetic signal K and phylogeny size, estimated for species degree 

(a), and strength (b). Each data point represents a phylogeny: green circles 
for plants and red squares for animals. Solid symbols indicate statistically 
significant phylogenetic signals. c, Comparison of phylogenetic signal for 
species degree and strength. Green and dashed red bars correspond to plants 
and animals, respectively. Estimates obtained for degree were significantly 
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Figure 3 | Correlation between ecological similarity and phylogenetic 
relatedness. Results of regular Mantel tests correlating phylogenetic and 
ecological distance matrices plotted against phylogeny size, obtained for 
plants (a) and animals (b). Each data point corresponds to a phylogeny, and 
a solid symbol indicates a statistically significant correlation. c, Comparison 
between Mantel Z estimates obtained separately for plants and animals 
composing each network. Communities where interaction patterns are more 
associated with animal phylogenies are depicted in red; those more 
associated with plants in green. The phylogenetic structure of animals 
correlates significantly better with interaction matrices than that of plants. 
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several regression models (Methods). One plausible explanation for 
this result is that flower or fruit morphology is variable across the 
taxonomic groups examined, whereas phenotypic variation for fru- 
givores and insects is more relevant at other taxonomic levels. 
Partitioning the phenotypic variance across taxonomic groups may 
address this hypothesis, and also clarify the mechanisms by which 
evolutionary history translates into patterns of interactions. 

We have shown that phylogenetically related species tend to have 
similar roles in the network of interactions in almost half of the 
communities studied here. These effects are not a reflection of differ- 
ences between major taxonomic groups only, because they also 
appear at finer scales of the phylogenies (Supplementary Fig. 3). In 
light of this, we tested whether simulated coextinctions’ involve taxo- 
nomically related species more often than expected by chance for 
cascades of identical size (Methods). This would result in a non- 
random pruning of the evolutionary tree’”’’. Simulations show that 
the rate of taxonomic diversity loss is higher than expected in the 
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Correlation between phylogeny and ecology 


Figure 4 | Phylogenetic resemblance induces a higher loss of taxonomic 
diversity after species extinctions. Taxonomic diversity of plants (a) and 
animals (b) of five communities as a function of the number of extinct 
species, removed from most specialized to most generalized. Each symbol 
represents the removal step leading to the next generalization level. 
Taxonomic diversity is the ratio between indexes from the real communities 
over randomization results removing phylogenetic effects. Full symbols 
indicate taxonomic losses below 95% of the null model. c, Average (+s.e.m.) 
rate of taxonomic loss of ten communities regressed against the magnitude 
of phylogenetic effects. 
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absence of phylogenetic signal (Fig. 4). Although these effects may 
seem quantitatively small (partly owing to the averaging nature of 
the index), they can actually encompass the extinction of entire fam- 
ilies or higher taxonomic groups from the community. The overall 
reduction in taxonomic diversity holds across communities, so that 
values falling below the null expectation are significantly more fre- 
quent than those above it (77 = 50.7, degrees of freedom, d.f. = 1, 
P<0.0001; Fig. 4). Moreover, the contribution of phylogeny to spe- 
cies patterns of interaction correlates with the magnitude of taxo- 
nomic diversity loss across communities (P< 0.05 for parametric 
and non-parametric correlations; Fig. 4c). Therefore, communities 
in which species interactions have a strong phylogenetic component 
are more prone to have closely related species going coextinct fol- 
lowing an extinction event. We conclude that the interaction between 
network and phylogenetic structures can ultimately result in non- 
random coextinction patterns. 

Plant—animal mutualisms form heterogeneous, nested networks 
built on weak and asymmetric links among species, which may facil- 
itate long-term species persistence®’. Our results provide evidence 
for the role of phylogenetic relationship as one determinant factor 
shaping these patterns: phylogeny partly accounts for species’ pro- 
pensities to interact in more than one-third of the networks, and the 
identity of the species with which they interact in about half of the 
networks. From a theoretical point of view, our results warrant the 
inclusion of evolutionary history into mechanistic models of network 
formation and maintenance™. From a conservation perspective, our 
results show that cascading effects of coextinction may spread across 
taxonomically related species, further increasing the erosion of taxo- 
nomic diversity. 


METHODS SUMMARY 


The amount of phylogenetic signal in degree and strength was quantified with 
the K statistic, which is the fraction of the amount of signal of the real data set 
over that expected, assuming brownian motion and the same tree topology. 
Significance of phylogenetic signal was tested with randomization and branch- 
length transformation methods'’. 

We used Mantel tests to compare phylogenetic distance matrices with mat- 
rices of ecological distance. Phylogenetic distance between pairs of plants (or 
animals) was estimated as the expected covariance of the trait between the two 
species''”°. Ecological distance was calculated as 1 — S, where S is the Jaccard 
index of similarity*®. We also performed partial Mantel tests controlling for the 
absolute difference in degree between two species. 

Species removal simulations started from the most specialized to the most 
generalized species’. After an extinction cascade, we calculated the community 
taxonomic diversity as the average taxonomic distance between species”’. Path 
length weights between species increased the more distantly related they were 
taxonomically (that is, species of the same genus have a distance of 1 whereas 
species from different genera within the same family have a distance of 2, and so 
on). Thus, the higher the index, the more diverse the community. To calculate 
the decrease of taxonomic diversity of the real community with respect to the 
expected decrease in the absence of phylogenetic signal, we replicated the coex- 
tinction cascade 1,000 times after randomizing the taxonomic affiliation of 
species going coextinct. The taxonomic diversity relative to the null expectation 
was the ratio between real and null values, and the significance was estimated by 
counting how often the real value fell below the randomization results. The 
average rate of taxonomic loss per community was calculated as the slope of a 
linear regression with an intercept forced through 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Database. We compiled 59 qualitative mutualistic networks (36 plant—pollinator 
and 23 plant—frugivore webs) describing the presence or absence of interactions. 
The data set encompasses Mediterranean, tropical, temperate, subtropical and 
Arctic communities from all continents except mainland Asia and Antarctica. Of 
these networks, 22 (9 for pollination and 13 for frugivory) are quantitative, 
describing the strength of each interaction or pairwise dependence (see data sets 
in Supplementary Information). From these networks, we calculated species 
degree and species strength for plants and animals separately, and recorded the 
taxonomic affiliation of species forming each community. 

Phylogenies. We assembled one animal and one plant phylogeny for each com- 
munity. Phylogenies are based primarily on molecular data, with a few species 
included according to taxonomic information (Supplementary Methods). 
Phylogenies with less than ten species or with too many unresolved nodes were 
not included in analyses. This resulted in 105 phylogenies for the following 
groups: 35 insect phylogenies (Class Insecta; all pollinators), 18 bird phylogenies 
(Class Aves; all frugivores), and 52 angiosperm phylogenies (Infraphylum 
Angiospermae; 33 belonging to plant—pollinator and 19 to plant—frugivore 
networks). 

Phylogenetic statistical methods. We tested for the presence of phylogenetic 
signal on species degree and strength with randomization and branch-length- 
transformation tests''. These methods test whether species attributes are signifi- 
cantly associated with phylogeny, using randomization or maximum-likelihood 
procedures. Branch-length transformation tests were performed assuming the 
Ornstein—Uhlenbeck model of stabilizing selection and a model in which char- 
acter evolution can accelerate or decelerate (ACDC)'’. Because ACDC models 
did not converge in most cases, here we discuss results from the Ornstein— 
Uhlenbeck model (all analyses are included in Supplementary Material for 
completeness). Although these techniques provide similar information about 
the presence of a phylogenetic signal, applying both methods can be useful in 
determining how robust our results are and in overcoming limitations inherent 
to each statistical test (Supplementary Figs 1 and 2). The amount of phylogenetic 
signal was quantified with the K statistic (Fig. 2), which is the fraction of the 
amount of signal of the real data set over that expected, assuming brownian 
motion and the same tree topology. 

We used Mantel tests to compare phylogenetic distance matrices with mat- 
rices of ecological distances between species. Phylogenetic distance between pairs 
of plants (or animals) was estimated as the expected covariance of the trait 
between the two species''”’. Ecological distance was calculated as 1 — S, where 
S is the Jaccard index of similarity obtained from qualitative interaction mat- 
rices*, The similarity between two species i and j is defined as S(i,j) = 
al(a+b+c), where a, b and c represent the number of shared interacting 
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species, the number of interactions specific to species i, and the number of 
interactions exclusive to species j, respectively. 

Because differences in degree affect Jaccard estimates, we also performed 
partial Mantel tests controlling for degree (the pairwise distance in degree was 
calculated as the absolute difference in degree between two species; Supplemen- 
tary Methods). Hence, this partial test can discern whether phylogeny strictly 
affects the species with which species interact, independently of the total number 
of interactions of each species. When necessary, estimates were log-transformed 
to improve normality (or log-value + 1 for statistics varying between —1 and 1, 
as Mantel’s Z). 

According to regression models controlling for phylogeny size, community 

size, and number of interactions, phylogenetic signal was similar for frugivory 
and pollination networks (P > 0.28 for Kand for Mantel’s regular and partial Z), 
hence results were pooled. 
Taxonomic diversity and coextinction simulations. As a surrogate for phylo- 
genetic diversity, we estimated taxonomic diversity of plants and animals in the 
largest available phylogenies (23 plant and 27 pollinator phylogenies with more 
than 30 species, and 15 bird phylogenies with more than 15 species; see 
Supplementary Methods). The mean taxonomic distance between all species 
was employed as an index of taxonomic diversity”’ in subsequent regressions. 

Extinction cascades were simulated for the ten largest communities (all having 
more than 40 animal and plant species) with available taxonomic affiliation, 
following ref. 7. After one species is removed, species left without any interaction 
go coextinct. Species removal started from the most specialized (least-linked) to 
the most generalized (most-linked) species, which was proposed as a more 
plausible extinction sequence because specialist species tend to be less abundant 
than generalists*’*. In spite of several assumptions implicit in the model (for 
example, all plants require animals for reproduction, and species cannot adapt to 
new resources), this approach provides the first reasonable attempt to study 
coextinction patterns in phylogenetically structured networks’. Although 
these assumptions may affect the total number of species going extinct’, our 
comparisons involve coextinction cascades of the same size, with and without 
phylogenetic signal. 

After an extinction cascade, we calculated the decrease of taxonomic diversity 
of the real community respect to the expected decrease in the absence of phylo- 
genetic signal. This was done by replicating the coextinction cascade after ran- 
domizing the taxonomic affiliation of species going coextinct (that is, nodes 
remain unchanged but their ‘name tags’ are shuffled). This null model removes 
effects of phylogenetic relatedness" controlling for network structure and spe- 
cies number. The relative taxonomic diversity is the ratio between real and null 
values, and the average rate of taxonomic loss per community is the slope of a 
linear regression with an intercept forced through 1 (that is, real values and the 
null expectation are equal when no species are removed). 
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Regulation of IgA production by naturally occurring 
TNF/iNOS-producing dendritic cells 


Hiroyuki Tezuka’, Yukiko Abe’, Makoto Iwata’, Hajime Takeuchi’, Hiromichi Ishikawa’, Masayuki Matsushita’, 


Tetsuo Shiohara’, Shizuo Akira® & Toshiaki Ohteki' 


Immunoglobulin-A has an irreplaceable role in the mucosal 
defence against infectious microbes’*. In human and mouse, 
IgA-producing plasma cells comprise ~20% of total plasma cells 
of peripheral lymphoid tissues, whereas more than 80% of plasma 
cells produce IgA in mucosa-associated lymphoid tissues 
(MALT)*®. One of the most biologically important and long- 
standing questions in immunology is why this ‘biased’ IgA 
synthesis takes place in the MALT but not other lymphoid organs. 
Here we show that IgA class-switch recombination (CSR) is 
impaired in inducible-nitric-oxide-synthase-deficient (INOS '~; 
gene also called Nos2) mice. iNOS regulates the T-cell-dependent 
IgA CSR through expression of transforming growth factor-p 
receptor, and the T-cell-independent IgA CSR through production 
of a proliferation-inducing ligand (APRIL, also called Tnfsf13) and 
a B-cell-activating factor of the tumour necrosis factor (TNF) 
family (BAFF, also called Tnfsf13b). Notably, iNOS is preferen- 
tially expressed in MALT dendritic cells in response to the recog- 
nition of commensal bacteria by toll-like receptor. Furthermore, 
adoptive transfer of iNOS* dendritic cells rescues IgA production 
in iNOS~‘~ mice. Further analysis revealed that the MALT dend- 
ritic cells are a TNF-a/iNOS-producing dendritic-cell subset, ori- 
ginally identified in mice infected with Listeria monocytogenes’*. 
The presence of a naturally occurring TNF-a/iNOS-producing 
dendritic-cell subset may explain the predominance of IgA pro- 
duction in the MALT, critical for gut homeostasis. 

IgA is the most abundant immunoglobulin in the body and has a 
critical role in mucosal immunity’. Secretory IgA is constitutively 
released into the lumen via the transport system of the intestinal 
epithelia’; it prevents invading pathogens from binding to mucosal 
epithelial cells and neutralizes their toxins**. Natural secretory IgA is 
induced by constant antigenic stimulation by the intestinal com- 
mensal bacteria, and dendritic cells have an important function in 
this process”'®. IgA-specific CSR occurs in the constant region of the 
immunoglobulin heavy chain (Cy), from Cpt to Ca, in naive B cells, 
and is accomplished by T-cell-dependent and T-cell-independent 
CSR°*. T-cell-dependent CSR is induced by TGF-B1 and CD40L 
expressed on activated T cells. On the other hand, two dendritic- 
cell-derived cytokines, APRIL and BAFF, directly mediate T-cell- 
independent CSR’"”. Notably, the IgA CSR mainly takes place in 
the MALT, particularly in Peyer’s patches and lamina propria, then 
IgA* B cells re-circulate through other lymphoid organs and the 
bloodstream*'*"*. In this context, one of the more enigmatic ques- 
tions is why IgA CSR and IgA secretion take place largely in the 
MALT but not other lymphoid organs. 

iNOS is upregulated by bacterial components and modulates 
immune responses’. However, little is known about the role of 


iNOS in immunoglobulin synthesis. Under steady-state conditions, 
the serum IgA and IgG2b levels in iNOS '~ mice were significantly 
lower compared with wild-type mice, whereas the serum levels of 
other immunoglobulin subclasses were unaffected in iNOS '~ mice 
(Fig. la). iNOS ‘~ mice also showed impaired secretory IgA produc- 
tion in both intestinal contents and faecal pellets (Fig. 1a). Notably, 
the impaired IgA production strictly correlates with the lack of the 
iNOS gene even in F, ((wild type X iNOS ‘~) F, X iNOS~'~) off- 
spring (Supplementary Fig. 1a), and the level of IgA in the sera and 
intestinal contents of wild-type mice treated with selective iNOS 
inhibitors (aminoguanidine (AMG) and L-N° -(1-iminoethyl)-lysine 
(L-NIL)) and a NO scavenger (carboxy-PTIO (C-PTIO)) was signifi- 
cantly lower than in untreated wild-type mice (Fig. 1b). In contrast, 
the number of intestinal T-cell-receptor (TCR)-y5* cells and the 
expression level of polymeric immunoglobulin receptor (pIgR) mes- 
senger RNA in intestinal epithelial cells—critical regulators for the 
transportation of secretory IgA into the lumen'®'’—were unaffected 
in iNOS '~ mice (Supplementary Fig. 1b, c). As expected from these 
observations, the number of post-CSR IgA* B cells (IgA *B220*) and 
IgA* plasma cells (IgA* B2207 ), and the in vitro production of IgA of 
these cells in the absence or presence of interleukin (IL)-5, were 
significantly reduced in iNOS '~ mice (Fig. lc, f and Supplemen- 
tary Fig. 1d) and wild-type mice treated with AMG, L-NIL and 
C-PTIO (Fig. 1d, e). This effect was probably not due to the impaired 
trafficking and survival of IgA* B cells (Supplementary Fig. le), and 
the expression level of IL-5 receptor and IL-5-dependent prolifera- 
tion were unaffected in B cells of iNOS '~ mice (Supplementary Fig. 
1f). Notably, impaired ovalbumin (OVA)-specific IgA response was 
also apparent in iNOS '~ mice (Fig. 1g). Thus, iNOS-derived NO is 
required for IgA synthesis in vivo. 

IgA CSR is mediated by both T-cell-dependent and T-cell- 
independent pathways’. To test whether the T-cell-dependent IgA 
CSR is impaired in iNOS ‘~ B cells, naive IgD* B cells of wild-type 
and iNOS ‘~ mice were stimulated with TGF-B1 and anti-CD40 
monoclonal antibody, an inducer for IgA CSR°. Additional stimu- 
lation, such as anti-CD40 plus interferon-y and anti-CD40 plus IL-4, 
was also included to induce IgG2c and IgG1/IgE CSR, respectively. 
Not only immunoglobulin production but also the expression of 
activation-induced cytidine deaminase (AID, an essential enzyme 
for CSR") and o-germinal transcript (o-GT, a prerequisite for Cu 
to Ca CSR) were impaired in iNOS‘ B cells under IgA-CSR- 
inducing conditions (Fig. 2a—c), indicating that the IgA CSR is 
impaired in iNOS ‘~ B cells. The TGF-f receptor, an essential com- 
ponent of T-cell-dependent IgA CSR, is composed of TGF-BRI and 
TGF-BRH'”°. Of note, the expression of TGF-BRII, but not TGF- 
BRI, in naive B cells prepared from the indicated organs of iNOS '~ 
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mice was much lower compared with that of wild-type mice (Fig. 
2d, e). Thus, the TGF-B1- and anti-CD40-mediated induction of 
downstream molecules of the TGF-f receptor, such as Smad3 
(ref. 21) and Runx3 (ref. 22), was decreased in iNOS '~ B cells 
(Fig. 2f). In contrast, neither CD40 expression nor CD40-dependent 
proliferation was affected in these cells (Supplementary Fig. 2a). 
Notably, the B-cell expression of TGF-BRII was restored when 
naive iNOS~‘~ B cells were cultured with NONOate, a NO donor, 
in a phosphatidylinositol-3 kinase/AKT-dependent manner in vitro 
(Supplementary Fig. 2b) and transferred into wMt (known as Igh-6) 
mice (Fig. 2g). The latter was coincident with the restoration of 
serum IgA level (Fig. 2h), indicating that iNOS '~ B cells retain 
the ability to secrete IgA in iNOS*'* host. These results demonstrate 
that the critical role for iNOS in the T-cell-dependent IgA CSR is to 
regulate the B-cell expression of TGF-BRIL 

To examine further whether iNOS~'~ T cells are functional in 
T-cell-dependent IgA production in vivo, T cells isolated from 
wild-type and iNOS ‘~ mice were transferred into T-cell-deficient 
Tcrb ’ Tcrd ‘~ mice, in which systemic and secretory IgA produc- 
tion was impaired owing to the lack of T-cell-dependent CSR 
(Supplementary Fig. 3a)’®. With time, the serum and intestinal con- 
tent IgA production was restored in Tcrb-’ Tcrd ‘~ mice that 
received either wild-type or iNOS ‘~ T cells, at a comparable 
level (Supplementary Fig. 3b). Furthermore, in the presence of 
TGF-B1, naive wild-type B cells co-cultured with activated 
iNOS ‘~ T cells produced IgA at a similar level to those co-cultured 
with activated wild-type T cells (Supplementary Fig. 3c). These 
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results indicate that iNOS ‘~ 
dependent IgA synthesis. 

We next examined T-cell-independent IgA production, which is 
mediated by dendritic-cell-derived cytokines APRIL and BAFF, as 
well as B-1 cells'®’*. Of note, iNOS was co-expressed with APRIL 
and BAFF in Peyer’s patches and lamina propria dendritic cells of 
wild-type mice, whereas the expression level of these cytokines was 
decreased in the equivalent dendritic cells of iNOS '~ mice (Fig. 3a, b 
and Supplementary Fig. 4a). In this context, on CpG stimulation, 
expression of APRIL and BAFF was elevated in wild-type dendritic 
cells but not in iNOS~'~ dendritic cells nor wild-type dendritic cells 
treated with L-NIL (Supplementary Fig. 4b). In contrast, expression 
of transmembrane activator and CAML interactor (TACT), a receptor 
for APRIL and BAFF, was unaffected in iNOS '~ B cells (Supplemen- 
tary Fig. 4c). As expected from these findings, IgA production from 
iNOS ‘~ B cells was impaired when co-cultured with Peyer’s patches 
and lamina propria dendritic cells of iNOS~‘~ mice, whereas it was 
significantly restored when co-cultured with the equivalent dendritic 
cells of wild-type mice or those of iNOS~'~ mice in the presence of 
APRIL (Fig. 3c, d). The T-cell-independent IgA production was 
efficiently blocked in the presence of BCMA- and TACI-Ig 
(Supplementary Fig. 4d). In this context, iNOS ‘~ dendritic-cell 
expression of APRIL was substantially restored when cultured with 
NONOate in a NFAT/NF-«B-dependent manner, whereas express- 
ion of TGE-BRII by iNOS ‘~ B cells remained impaired even after 
APRIL stimulation (Supplementary Fig. 4e, f). In addition, the level 
of APRIL-induced IgA synthesis was unaffected in the presence of 
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Figure 1| IgA reduction in iNOS~/~ mice. a, b, Samples from the sera, 
intestinal content (IC) and faecal pellets (FP) of control wild-type and 
iNOS ‘~ mice (a), and wild-type mice given 2.5% (w/v) AMG for 7 days, 
4.5mM 1-NIL for 14 days, and 3 mg day _' C-PTIO for 14 days (b), were 
tested for immunoglobulin levels by ELISA. Each group includes three to 
four mice and data are representative of three experiments. c,d, Mononuclear 
cells from the spleen (Spl.), mesenteric lymph node (MLN), Peyer’s patches 
(PP) and lamina propria (LP) cells of indicated mice were stained for IgA and 
B220. Numbers in each rectangle indicate the proportion of the cells. Data 
are representative of three experiments. e, Lamina propria cells of AMG-, 
L-NIL- and C-PTIO-treated wild-type mice were cultured for 3 days. f, B cells 


930 


from the spleen, mesenteric lymph node and Peyer’s patches, and total LP 
cells, of indicated mice were cultured in the absence or presence of 

100 ng ml" IL-5 for 7 days. Immunoglobulin level in the culture medium is 
shown in e and f. Data in e, f, show mean + s.d. of triplicate cultures and are 
representative of five experiments. g, Mice were immunized with 100 pg 
OVA, boosted with 50 ug OVA 21 days after immunization, and OVA- 
specific immunoglobulin levels were determined 10 days later by ELISA. 
Each group includes three mice and data are representative of two 
experiments. Asterisks in relevant panels indicate unpaired Student’s t-test; 
P value is <0.05. Error bars represent s.d. 
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Figure 2 | Impaired T-cell-dependent IgA CSR in iNOS ’~ mice. Naive IgD~* 
B cells isolated from the spleen of wild-type and iNOS ‘~ mice were cultured 
with the indicated stimuli for 7 days (a) or 3 days (b, ¢, f). The cytokines and 
monoclonal antibody used in this experiment were: 1 ngml_' TGF-B1, 

100 ng ml’ interferon-y, 100 ng ml * IL-4, 2 1g ml’ anti-CD40 monoclonal 
antibody. a, IgA production by stimulated B cells was determined by ELISA. 
Data show mean = s.d. of triplicate cultures. b, ¢, Aid (also called Aicda) and 
a-GT (an intermediate germinal transcript for Ci to Cx CSR) mRNA 
expression in stimulated B cells was examined by RT-PCR. d, e, TGF-BRI and 
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Figure 3 | Impaired T-cell-independent CSR in INOS’ mice. a, b, Peyer’s 
patches (a) and lamina propria (b) dendritic cells of wild-type and iNOS ‘~ 
mice were examined for April and Baff expression by real-time PCR. Data 
show mean + s.d. of three samples. c, d, Splenic IgD* B cells of wild-type and 
iNOS ‘~ (KO) mice were cultured with Peyer’s patches (c) and lamina 
propria (d) dendritic cells (DCs) of wild-type and iNOS '~ (KO) mice in the 
presence or absence of 100ngml~' APRIL for 7 days, and the IgA level was 
evaluated by ELISA. Data show mean = s.d. of triplicate cultures and are 
representative of five experiments. Asterisks in relevant panels indicate 
unpaired Student’s t-test; P value is <0.05. 


TGE-BRII expression levels (Tgfbr1 and Tgfbr2, respectively) in naive B cells of 
the indicated tissues were evaluated by RT-PCR (d) and real-time PCR 

(e). f, Expression of Smad3 and Runx3 mRNA in stimulated B cells was 
assessed by real-time PCR. g, h, Wild-type and iNOS '~ naive IgD" B cells 
were transferred into uMt / ~ mice. Three weeks after the transfer, naive B cells 
were tested for TGF-BRII expression by real-time PCR (g) and serum IgA level 
was determined by ELISA (h). NS, not significant. Data are representative of 
five experiments (b—d) or show mean = s.d. of three samples (e—h). Asterisks 
in relevant panels indicate unpaired Student’s t-test; P value is <0.05. 


anti-TGF-B neutralizing antibody or TGF-B1 (Supplementary Fig. 
4g), showing that APRIL regulates IgA synthesis independently of 
TGF-B/TGF-BR in mice. Furthermore, the number and in vitro IgA 
production of B-1 cells were reduced in iNOS~’~ mice (Supple- 
mentary Fig. 4h, i). 

These results indicate that iNOS might be expressed in the dend- 
ritic cells residing in the sites where IgA CSR predominates in vivo. 
Notably, iNOS was preferentially expressed in CD11c’° dendritic cells 
in the MALT. The number of iNOS* dendritic cells was significantly 
higher in the lamina propria than in the Peyer’s patches and mes- 
enteric lymph node, and negligible in the spleen, of wild-type mice 
(Fig. 4a), which correlates well with the IgA production in each site 
(Fig. 1f). To identify the induction mechanism of iNOS* dendritic 
cells in the MALT under steady-state conditions, we further analysed 
Myd88 ‘~, germ-free and Tlr2-'~ Tira‘ TIr9”'~ mice. The num- 
ber of iNOS* dendritic cells was much reduced in the MALT of 
these mice (Fig. 4a and Supplementary Fig. 5a), suggesting that this 
dendritic cell subset is induced by TLR-dependent recognition of 
commensal bacteria. The lamina propria dendritic cells also 
expressed gaseous NO, suggesting that NO is produced by iNOS* 
dendritic cells (Fig. 4b). The iNOS* dendritic cells were TNE-o.* 
CD11b*MHC class II*CD80*CD86"°Ly6C*Ly6G" Gr-1*Mac3*, 
revealing that they are presumably a naturally occurring TNF-o/ 
iNOS-producing dendritic cell subset (Fig. 4c and Supplementary 
Fig. 5b)’*. The absence of iNOS* dendritic cells in iNOS’, 
Myd88"'~, germ-free and Tir2-'~ Tlr4‘~ TIr9”'~ mice correlates 
well with the impaired IgA production in the sera and intestinal 
contents of these mice (Fig. 4d and Supplementary Fig. 5c). 
Furthermore, IgA production, B-cell expression of TGF-BRII, and 
the numbers of IgA~ B cells and IgA-producing plasma cells were 
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restored in iNOS'~ mice when transferred with a non-T/B-cell 
fraction of iNOS*'* lamina propria cells and iNOS*'* lamina pro- 
pria dendritic cells (Fig. 4e, fand Supplementary Fig. 5d). In contrast, 
these factors (IgA production, B-cell expression of TGF-BRIL, and the 
numbers of IgA* B cells and IgA-producing plasma cells) remained 
low in iNOS ‘~ mice when transferred with a dendritic-cell-depleted 
non-T/B-cell fraction of iNOS‘’* lamina propria cells, a non-T/B- 
cell fraction of iNOS ~‘~ lamina propria cells, iNOS‘~ lamina pro- 
pria dendritic cells and iNOS*’* spleen dendritic cells (Fig. 4e). 
Collectively, these results demonstrate that TNF-«/iNOS-producing 
dendritic cells are essential for IgA production in vivo. 

Our results show that IgA production is selectively reduced in 
iNOS '~ mice. iNOS controls B-cell expression of TGF-BRII in 
T-cell-dependent IgA CSR, as well as dendritic-cell-derived APRIL 
and BAFF, and B-1-cell-derived IgA production in T-cell-independent 
IgA CSR. Consistent with these observations, mice lacking TGF- 
BRI in B cells show a selective reduction of IgA in the sera and 
external secretions'’””°, and the serum IgA level and IgA response 
to mucosal antigen exposure are reduced in April '~ mice? and 
Taci ‘~ mice”, With respect to the molecular mechanism, NO reg- 
ulates expression of TGF-BRII on B cells in a phosphatidylinositol-3 
kinase/AKT-dependent manner, whereas it influences APRIL secre- 
tion by dendritic cells through NFAT- and NF-«B-dependent 
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Figure 4 | Identification of iINOS* dendritic cells in the MALT. 

a, CD3e B220° cells from the indicated tissues of C57BL/6, B6.iNOS '~, 
B6.Myd88 ', BALB/c and BALB/c germ-free mice were stained for CD11c 
and intracellular iNOS. b, NO detection in lamina propria dendritic cells by 
diaminofluorescein (DAF) staining. Original magnification, < 400. 

c, Characterization of lamina propria iNOS* dendritic cells. Lamina propria 
iNOS* dendritic cells were stained for the indicated surface markers. Solid 
line, each marker staining; dashed line, isotype control staining. d, IgA level 
in the sera and intestinal content of C57BL/6 (n = 5), B6.iNOS ' (n=5), 
B6.Myd88 ‘~ (n = 8), BALB/c (n = 5) and BALB/c germ-free (n = 12) mice 
was measured by ELISA. e, IgA level in the sera and intestinal content of 
iNOS ‘~ mice transferred with a non-T/B-cell fraction of iNOS*!* lamina 
propria cells (LP), a dendritic-cell-depleted non-T/B-cell fraction of 
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pathways. Supporting our results, NF-«B is critical for BAFF induc- 
tion”. In addition, we also found that the naturally occurring TNF-c/ 
iNOS-producing dendritic cells comprised a portion of retinal 
dehydrogenase (RALDH™) dendritic cells in the lamina propria 
(Supplementary Fig. 6a—c), which imprint gut-homing specificity 
on T and B cells By inducing the integrin «#487 (refs 26, 27); the 
number of «487* T and B cells was unaffected in TNF-«/iNOS- 
producing dendritic-cell-deficient iNOS”'~ mice (Supplementary 
Fig. 6d, e). Accordingly, we propose that TNF-o«/iNOS-producing 
dendritic cells directly govern IgA class-switching of naive B cells, 
whereas RALDH™ dendritic cells imprint gut-homing specificity of 
the post-CSR IgA” B cells. 

Impaired IgA production and intestinal epithelial cell homeostasis 
were observed in germ-free mice under steady-state conditions'®”’, 
and dendritic cells that reside between the intestinal epithelial cells 
sample commensal bacteria at the epithelial surface and participate in 
IgA induction in the MALT°”**”. Notably, we observed that iNOS is 
expressed in a subset of MALT dendritic cells, this expression is 
affected by toll-like-receptor-dependent recognition of commensal 
bacteria, and it has a critical role in IgA production. Our findings 
shed light on the important question of why IgA production, which is 
critical for the maintenance of gut homeostasis, takes place pre- 
dominantly in the MALT. 
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(LP DCs), iNOS ~/~ lamina propria dendritic cells (iNOS~‘~ LP DCs), 
iNOS*'* spleen dendritic cells (Spl. DCs), and iNOS ‘~ spleen dendritic 
cells (iNOS ‘~ Spl. DCs). Each group includes three mice. Two weeks after 
the transfer, the IgA level was determined by ELISA. f, TGF-BRII expression 
in naive B cells of C57BL/6, iNOS ‘~, iNOS ‘~ mice transferred with wild- 
type lamina propria dendritic cells (WT LP DCs) and iNOS ‘~ lamina 
propria dendritic cells (NOS '~ LP DCs). Two weeks after the transfer, the 
TGF-BRII expression was evaluated by real-time PCR. Data are 
representative of three experiments. Asterisks in relevant panels indicate 
unpaired Student’s t-test; P value is <0.05. Error bars represent s.d. 
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METHODS SUMMARY 

Mice. C57BL/6 mice (B6 mice) and BALB/c mice were obtained from Clea. 
B6.iNOS ‘~ mice were obtained from Taconic Farms, Inc. B6.Rag2! ~ mice 
were provided by Central Laboratories for Experimental Animals. F, ((C57BL/ 
6 X iNOS ‘~) F, X iNOS '~) offspring were made by crossing C57BL/6 mice 
and iNOS ‘~ mice. B6.iNOS ‘~ Rag? ‘~ mice were generated by crossing 
B6.iNOS '~ mice and B6.Rag2' mice. BALB/c germ-free mice were 
obtained from Yakult Central Institute for Microbiological Research. 
B6.TIr2 ‘ Tlr4 ’ Tlr9’— mice were made by crossing each Tir ‘~ mouse. 
All mice except for the germ-free mice were maintained in our SPF animal facility 
using the Micro-VENT system (Allentown Caging Equipment Company), and 
all animal experiments were done with the approval of the Institutional Animal 
Care Committee of the Akita University. 

Reagents and in vivo treatment. To examine the requirement of iNOS and NO 
for IgA production, wild-type mice were given 2.5% (w/v) aminoguanidine 
(Calbiochem), 4.5 mM L-N°-( 1-iminoethyl)-lysine (Wako), and 3 mg day 1 car- 
boxy-PTIO — (2-(4-carboxyphenyl)-4,4,5,5-tetramethylimidazoline-1-oxyl-3- 
oxide) (Dojindo) intraperitoneally and in drinking water for 7-14 days. 
Immunization. Mice were immunized intraperitoneally with 100 j.g alum-pre- 
cipitated ovalbumin (Sigma) (OVA/alum), boosted intraperitoneally with 50 pg 
OVA/alum 21 days later, and samples from the sera were collected 10 days after 
booster for estimation of anti-OVA antibodies. 

Measurement of immunoglobulin levels. Samples were collected from sera, 
intestinal content, faecal pellets and culture supernatants. The levels of IgM, 
IgG, IgGl, IgG2b, IgG2c and IgG3 were determined using the mouse ELISA 
quantification kit (Bethyl). 

PCR. For semi-quantitative RT-PCR analysis, serial dilutions of the cDNA 
templates were amplified using specific primers on a DNA thermal cycler (MJ 
Research). Primers for pIlgR, AID, «-GT, TGF-BRI, TGF-BRII, iNOS and 
GAPDH were obtained from Hokkaido System Science Co. Real-time PCR 
analysis was performed using LightCycler (Roche) to measure SYBR green 
(Roche) incorporation. Primers for TGF-BRII, Smad3, Runx3, APRIL, BAFF 
and f-actin were obtained from Nihon gene research Laboratories Inc. 
Relative expression of mRNA was normalized to B-actin mRNA levels within 
each sample. 

Statistical analysis. The statistical significance of the obtained values was eval- 
uated by Student’s t-test. A P value <0.05 was considered significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. B6.Terb ’ Tcrd /—*, B6.Myd88 /~* and B6.uMr* mice were as 
described. 

Measurement of immunoglobulin levels. The level of IgA was measured by 
enzyme-linked immunosorbent assay (ELISA) as described previously*'. 

Cell preparation. To prepare naive B cells, spleen cells were labelled with PE- 
conjugated anti-mouse IgD mAb (eBioscience) and anti-PE MicroBeads 
(MiltenyiBiotec). IgD* B cells were isolated using AutoMACS (MiltenyiBiotec) 
and used as naive B cells (>97% IgD*). Likewise, B220* cells isolated from 
spleen, MLN, and PP were used as conventional B cells (>98% B220*). To 
prepare B-1 cells, peritoneal cells were labelled with PE-BMB8 (anti-F4/80), PE- 
B3Bé4 (anti-CD23), and Thy1.2 (anti-CD90.2) MicroBeads and subsequently with 
anti-PE MicroBeads. The negative fraction was used as B-1 cells. To prepare BM 
DCs, BM cells were cultured in the presence of 10ngml | recombinant mouse 
GM-CSF (R&D Systems) for 6 days by replacing medium every 3 days. After 
6 days of culture, BM cells were further stimulated with 20 ng ml ' recombinant 
mouse TNF-o for 2 days (R&D Systems). The cells were then labelled with N418- 
MicroBeads (anti-mouse CDlIlc) (MiltenyiBiotec) and isolated using 
AutoMACS. 

To prepare the intestinal intraepithelial lymphocytes (iIELs), PPs, fatty tissues 
and mesentery were removed from the small intestine. The small intestine was 
opened longitudinally and cut into 5-mm pieces. The pieces were washed gently 
and shaken for 45min at 37°C in RPMI 1640 medium containing 5% FCS, 
100U ml ' penicillin G, and 100 pg ml”! streptomycin. Whole-intestinal-cell 
suspensions, which include iECs and ilELs, were applied to discontinuous 
Percoll density gradients (Amersham Bioscience) of 25%, 40% and 75%, and 
cells at the interface between 40% and 75% were collected and used as IELs. After 
removal of iECs and ilELs, the remaining tissue was further digested with col- 
lagenase (type IV, Sigma), and cells at the interface between 30% and 60% of 
Percoll gradient were collected and used as LP cells. To purify DCs, collagenase 
digested spleen, MLN, PP and LP cells were labelled with N418-MicroBeads 
(anti-mouse CD11c) (MiltenyiBiotec) and isolated using AutoMACS. These 
purified CD11c* cells (>95% CD11c* I-A*) were used as BM, spleen, MLN, 
PP and LP DCs. 

Cell culture. To examine Ig secretion, B cells were cultured with anti-CD40 mAb 
Q ug ml |; eBioscience), LPS (2 ug ml |, Sigma), TGF-B1 (1 ngml |, RDI), 
mouse IL-5 (100 ngml', R&D Systems), mouse IL-15 (100 ng ml ', RDI, 
mouse APRIL (100ngml~', RDI), and anti-TGFB1 mAb (10ugml-', R&D 
Systems), and the culture supernatants were collected for ELISA. A cell prolif- 
eration assay was also performed in some experiments, in which the cells were 
pulsed with 1 Ci per well *H-thymidine (PerkinElmer Life Science) for the 
last 16h of a 72-h culture, and *H-thymidine incorporation was measured by 
a B-counter (Packard). In some experiments, 1 X 10°T cells that had been 
activated with anti-CD3e mAb (1 ug ml ') and anti-CD28 mAb (1 lg ml ') 
(eBioscience), were co-cultured with 5 X 10° splenic naive B cells in the presence 
of TGF-B1 for 7 days. Likewise, 5 X 10° B cells were co-cultured with 1.25— 
2.5 X 10° DCs in the presence or absence of APRIL, TACI-Ig (3.3p1gml ', 
R&D Systems) or BCMA-Ig (3.3 pgml_', R&D Systems) for 7 days. Splenic 
IgD* B cells were cultured with LY294002 (PI3 kinase inhibitor, Calbiochem) 
or Akt inhibitor (Calbiochem) for 1h before addition of 100 4M spermine 
NONOate (Alexis Biochemicals) for 18h. LP DCs were cultured with 5 uM 
BAY11-7082 (NF-«B inhibitor, Calbiochem), 5 uM 11R-VIVIT (NFAT inhibitor 
peptide, RRRRRRRRRRR-GGG-MAGPVIVITGPHEE, Sigma Genosys)**, 5 uM 
11R-VEET (control peptide, RRRRRRRRRRR-GGG-MAGPPHIVEETGPHVI, 
Sigma Genosys)**, or 500}1M L-NIL for 1h before the addition of 100 1M 
spermine NONOate or 1 uM CpG-A (5’-TCCATGACGTTCCTGATGCT-3’, 
Hokkaido System Science) for 18h. 

Antibodies and flow-cytometric analysis. Cells were stained with the following 
mAbs: fluorescein isothiocyanate (FITC)-N418 (anti-CD1lc), phycoerythrin 
(PE)-RA3-B2 (anti-B220), PE-53-6.7 (anti-CD8«), PE-B3B4 (anti-CD23), PE- 
BM8 (anti-F4/80), biotin-UC7-13D5 (anti-TCRy8), biotin-1C10 (anti-CD40), 
biotin-N418, biotin-M1/70 (anti-CD11b), biotin-16-10A1 (anti-CD80), biotin- 
GL1 (anti-CD86), biotin-RB6-8C5 (anti-Gr-1), biotin-53-6.7 (anti-CD8«), 
biotin-MB19-1 (anti-CD19), biotin-RA3-B2, biotin-53-2.1 (anti-Thy1.2), 
biotin-2E7 (anti-CD103), biotin-DX5 (anti-DX5), biotin-NLDC-145 (anti- 
CD205), and biotin-4B12 (anti-CCR7). These mAbs were purchased from 
eBioscience. FITC-C10-3 (anti-IgA), FITC-1A8 (anti-Ly6G), and biotin-AL-21 
(anti-Ly6C) were purchased from BD PharMingen. Biotin-H-7 (anti-IL-5Ra) 
was provided by T. Kouro and K. Takatsu of Tokyo University. Biotinylated 
mAbs were detected with streptavidin-PE-Cy5 (eBioscience). For intracellular 
iNOS and Mac-3 staining, CD3e B220° cells of the spleen, MLN, PP and LP 
were stained with biotin N418 and streptavidin-PE-Cy5, and were fixed and 


nature 


permeabilized with Cytofix/Cytoperm solution (BD Pharmingen). The cells 
were then stained intracellularly with PE-N20 (anti-iNOS, Santa Cruz) and 
FITC-M3/84 (anti-Mac-3) (eBioscience). For intracellular TNF- staining, 
CD3e B220~ cells of the LP were cultured with 50ng ml! PMA (Wako Pure 
Chemical Industries), 500ngml~' ionomycin (Sigma), and GolgiPlug (BD 
Pharmingen) for 4h, after which they were stained with biotin N418 and PE- 
streptavidin-Cy5 (eBioscience), and fixed and permeabilized as described above. 
The cells were then stained intracellularly with PE-N20 and FITC-MP6-XT22 
(anti- TNF-a). FITC-rat IgG1 (eBioscience) was also included as isotype controls 
for Mac-3 and TNF-«. Samples were analysed on a FACSCalibur using the 
CELLQuest program (Becton Dickinson). 
Immunocytochemical analysis. LP DCs and splenic DCs were fixed and per- 
meabilized with Cytofix/Cytoperm solution (BD Pharmingen). These cells were 
stained with rabbit anti-mouse RALDH1 or RALDH2 (prepared in-house), and 
PE-N-20 (anti-iNOS). For RALDH staining, cells were further stained with 
FITC-conjugated anti-rabbit IgG (Jackson ImmunoResearch) and AlexaFluor 
488-conjugated anti-FITC (Molecular Probes). To detect intracellular NO, LP 
DCs were incubated with 101M diaminofluorescein (DAF)-FM diacetate 
(Daiichi Pure Chemicals) and then stained with PE-N418 (anti-CD11c). 
Stained cells were mounted on slides and analysed by fluorescence microscopy. 
RT-PCR. Specific primer pairs used for RT-PCR were as follows: pIgR sense (5'- 
TTGTTCACGCTCTTGGTAACTG-3’) and plgR antisense (5’'-ACAGGCC- 
TCGGTTACTGGTACC-3’); AID sense (5'-ATATGGACAGCCTTCTGATG- 
AAGC-3') and AID anti-sense (5'-TCAAAATCCCAACATACGAAATGC-3’); 
aGT sense (5’-CTACCATAGGGGAAGATAGCCT-3’) and «GT anti-sense (5'- 
TAATCGTGAATCAGGCAG-3’); TGF-BRI sense (5’-ATCCATCACTAG- 
ATCGCCCT-3’) and TGF-BRI anti-sense (5’-CGATGGATCAGAAGGTA- 
CAAGA-3'); TGF-BRII sense (5’-CGTGTGGAGGAAGAACAACA-3') and 
TGF-BRII anti-sense (5’-TCTCAAACTGCTCTGAGGTG-3’); iNOS sense (5'- 
CGTTGGATTTGGAGCAGAAGTG-3’) and iNOS anti-sense (5’-CATGC- 
AAAATCTCTCCACTGCC-3’); GAPDH sense (5’-ACCACAGTCCATGCC- 
ATCAC-3’) and GAPDH anti-sense (5’-TCCACCACCCTGTTGCTGTA-3’). 
Specific primer pairs used for real-time PCR analysis were as follows: TGF- 
BRIL sense (5'’-CCTACTCTGTCTGTGGATGA-3’) and TGF-BRII anti-sense 
(5’-GCTCGTAATCCTTCACTTCTC-3’); Smad3 sense (5'-GCACAGCCAC- 
CATGAATTAC-3’) and Smad3 anti-sense (5’-GCACCAACACTGGAGG- 
TAGA-3’); Runx3 sense (5'‘-ACCACGAGCCACTTCAGCAG-3’) and Runx3 
anti-sense (5'-CGATGGTGTGGCGCTGTA-3’); APRIL sense (5’-TCACAA- 
TGGGTCAGGTGGTATC-3’) and APRIL anti-sense (5’-TGTAAATGAAA- 
GACACCTGCACTGT-3'); BAFF sense (5'-TGCTATGGGTCATGTCATCCA- 
3’) and BAFF anti-sense (5'’-GGCAGTGTTTTGGGCATATTC-3’); B-actin 
sense (5'-GATGACGATATCGCTGCGCTG-3’) and f-actin anti-sense (5’- 
GTACGACCAGAGGCATACAGG-3’). Relative expression of mRNA was nor- 
malized to B-actin mRNA levels within each sample. 
Adoptive transfer. To examine the IgA productivity of iNOS '~ B cells, naive B 
cells (a mixture of 2 X 10” splenic B cells, 1.5 X 10° MLN B cells, and 3 X 10° PP 
B cells) of WT and iNOS~‘~ mice were intravenously transferred into uMt 
hosts. To examine the function of iNOS /~ T cells, 1 X 10” spleen T cells of 
WT and iNOS ‘~ mice were transferred into Terb/~ Tcrd’~ mice. To examine 
the role of iNOS'’*DCs in IgA production, 2 X 10° of a non-T/B fraction of 
iNOS*“* LP cells, a DC-depleted non-T/B fraction of iN os*/* Lp cells, a non- 
T/B fraction of iNOS~/~ LP cells, and 2 X 10° iNOS*’* LP DCs, iNOS~'~ LP 
DCs, iN OS*’* SPL DCs were transferred into iNOS ‘~ mice. Two to four weeks 
after the transfer, the IgA level was determined by ELISA. To examine the traf- 
ficking and survival of IgA* B cells in iNOS ‘~ hosts, PP IgA B cells (4 x 10° 
cells) or LP IgA* B cells (2 X 10° cells) of B6.SJL (CD45.1* ) mice were intrave- 
nously transferred into iNOS*/*+Rag2~'~ (WT), and iNOS‘ Rag2/~ 
(iNOS /~) hosts (CD45.2*). Sixteen hours (migration of the PP IgA* B cells) 
and 6 days (survival of the LP IgA” B cells) after the transfer, LP cells isolated 
from hosts were stained with FITC-104 (anti-CD45.2) and PE-A20 (anti- 
CD45.1) and analysed on FACSCalibur using the CELLQuest program 
(Becton Dickinson). 
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An IRF8-binding promoter variant and AIRE control 
CHRNAT promiscuous expression in thymus 


Matthieu Giraud’, Richard Taubert®, Claire Vandiedonck’, Xiayi Ke*, Matthieu Lévi-Strauss‘, Franco Pagani’, 
Francisco E. Baralle’, Bruno Eymard®, Christine Tranchant’, Philippe Gajdos®, Angela Vincent’, Nick Willcox’, 


David Beeson’, Bruno Kyewski? & Henri-Jean Garchon' 


Promiscuous expression of tissue-restricted auto-antigens in the 
thymus imposes T-cell tolerance and provides protection from 
autoimmune diseases’ *. Promiscuous expression of a set of self- 
antigens occurs in medullary thymic epithelial cells*’ and is partly 
controlled by the autoimmune regulator (AIRE), a nuclear protein 
for which loss-of-function mutations cause the type 1 autoimmune 
polyendocrine syndrome®’. However, additional factors must be 
involved in the regulation of this promiscuous expression. Here 
we describe a mechanism controlling thymic transcription of a 
prototypic tissue-restricted human auto-antigen gene, CHRNAI. 
This gene encodes the a-subunit of the muscle acetylcholine recep- 
tor, which is the main target of pathogenic auto-antibodies in auto- 
immune myasthenia gravis*”. On re-sequencing the CHRNA1 gene, 
we identified a functional bi-allelic variant in the promoter that is 
associated with early onset of disease in two independent human 
populations (France and United Kingdom). We show that this vari- 
ant prevents binding of interferon regulatory factor 8 (IRF8) and 
abrogates CHRNAI promoter activity in thymic epithelial cells 
in vitro. Notably, both the CHRNAI promoter variant and AIRE 
modulate CHRNAI messenger RNA levels in human medullary 
thymic epithelial cells ex vivo and also in a transactivation assay. 
These findings reveal a critical function of AIRE and the interferon 
signalling pathway in regulating quantitative expression of this 
auto-antigen in the thymus, suggesting that together they set the 
threshold for self-tolerance versus autoimmunity. 

To understand better the relationship between genetic variants, 
thymic auto-antigen expression and autoimmune susceptibility we 


undertook a comprehensive study of the polymorphism of the 
CHRNAI locus. Through re-sequencing in 16 individuals, we iden- 
tified 70 variants that were then genotyped in 45 patients. We selected 
11 tagging single nucleotide polymorphisms (SNPs), 5 of which 
represented allelic groups (A—E) (Supplementary Figs 1 and 2, and 
Supplementary Table 1). In 180 French myasthenia gravis patients 
overall, the allele and genotype frequencies were similar to those in 
151 population-matched controls (Supplementary Table 2), but the 
age of onset of myasthenia gravis was influenced significantly by the B 
SNP group (as shown by multiple linear regression, P= 0.007). 
Exclusion of the upper and lower 5% of onset-age distribution did 
not alter the findings (P = 0.00046), ruling out a bias by extremes. 
Group B included 12 tightly associated SNPs, three of which 
(1s2600683, rs16862847 and 1s2600686) were potentially functional 
but genetically indistinguishable (Table 1 and Supplementary Table 
3). The rs16862847 variant, located 478 nucleotides upstream of the 
translation initiation codon, has the potential to influence acet- 
ylcholine receptor (AChR) expression. Its minor G allele disrupts a 
unique interferon-stimulated response element that is a consensus 
binding motif for IRF8 (similarity score of 0.94 for a threshold of 
0.81) (ref. 10). Previously known as interferon consensus sequence 
binding protein (ICSBP), this transcription factor is involved in sig- 
nalling of interferons and also modulates their expression'’'”. 
Genotyping of rs16862847 in the French data set showed an increased 
minor allele frequency in myasthenia gravis patients with an age at 
disease onset in the lower quartile (odds ratio, 2.35; 95% confidence 
interval (CI), 1.31—4.23; P = 0.0049) compared to the control group, 


Table 1| Association of rs16862847 G with early onset of disease in French and UK myasthenia gravis patients 


Gallele Genotype AG+GG/AA 
Population Group N Frequency Odds ratio (95% Cl) P-value AA AG GG Odds ratio (95% Cl) P-value 
France Pio 50 0.23 2.35 (1.31-4.23) 0.0049 27 23 0 3.3 (1.67-6.52) 0.0008 
Puta 130 0.11 104 24 2 
Cc 151 0.11 120 28 3 
United Kingdom Pig 46 O21 2.01 (1.04-3.87) 0.029 29 15 2 2.08 (0.98-4.4) 0.044 
Puta 104 0.11 87 12 5 
iG 109 0.11 85 23 1 
Combined Pio 96 0.22 2.19 (1.41-3.39) 0.00048 56 38 2 2.66 (1.6-4.41) 0.00015 
Puta 234 0.11 191 36 7 
c 260 0.11 205 51 4 
P_gindicates patients with early onset of myasthenia gravis, defined by age at the onset of disease in the lower quartile of the distribution in each cohort (=21yr in French patients, including 8 
patients of age 21, and $18 yr in UK patients, including 10 patients of age 18). Pura indicates patients in the upper three quartiles of the distribution. C indicates matched controls. Odds ratios were 


determined relative to the control populations. The exact P-values were tested two-sided in the French cohort, and one-sided in the replication UK cohort. The combined data set was tested two- 
sided using stratified 2 x 2 tables. The data also fitted a partially (or semi-) dominant model (Cochran—-Armitage test with strata: P= 4 X 10 *: odds ratio for AG versus AA genotypes, 2.27, 95% Cl, 


1.4-3.69; for GG versus AA genotypes, 5.15, 95% Cl, 1.96-13.6). 


nserm, U580, 75015 Paris, France; Université Paris-Descartes, 75015 Paris, France. 7Neuroscience Group, Weatherhall Institute of Molecular Medicine, Oxford University, 
Headington, Oxford OX3 9DS, UK. *Division of Developmental Immunology, Tumor Immunology Program, German Cancer Research Centre, INF 280, D-69120 Heidelberg, Germany. 
*Centre for Integrated Genomic Medical Research (CIGMR) and Arthritis Research Campaign (ARC) Epidemiology Unit, University of Manchester, Manchester M13 9PT, UK. 
5International Centre for Genetic Engineering and Biotechnology Padriciano 99, 34012 Trieste, Italy. “Institut de Myologie, Hépital de la Salpétriére, 75013 Paris, France. Service de 
Neurologie, Hépital Civil, CHRU, 67091 Strasbourg, France. 8Service de Réanimation, Hépital Raymond Poincaré, 92380 Garches, France. 
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confirming the relationship to age at onset (Table 1). This association 
was replicated in a second series of 150 myasthenia gravis patients 
and 109 controls from the United Kingdom (t-test, P= 0.0058 and 
Table 1). Combining both data sets increased the evidence for an 
association of rs16862847 with an early onset of the disease 
(P= 1.5 X 10 *; common odds ratio, 2.66; 95% CI, 1.6-4.41). 

To investigate the functional significance of rs16862847, we per- 
formed an electrophoretic mobility shift assay (EMSA) on radiolabel- 
led oligonucleotides with nuclear extracts from the murine RAW264.7 
macrophage cell line; this represents the myelo-monocytic lineage 
where expression of IRF8 was first described'’. We found an allele- 
dependent molecular complex characterized by one or two bands 
(first two lanes, Fig. la). The upper band was specific for the major 
A allele whereas the lower one was shared by both rs16862847 alleles. 
Competition with unlabelled oligonucleotide or an irrelevant probe 
(nuclear factor of activated T-cells, NFAT) confirmed the specificity. 
Moreover, the A-allele-specific band bound IRF8, as it was super- 
shifted by an anti-IRF8 antibody but not by an irrelevant anti-IRF1 
antibody (third lane, Fig. 1b). 

Among the other two candidate SNPs of group B, rs2600686 was 
situated in the third intron of CHRNAI and potentially created an 
intronic splicing processing element". However, we found no evid- 
ence for a functional role of this variant (Supplementary Fig. 3). For 
the third potentially functional variant, rs2600683, the minor C allele 
created a DNA-binding motif for NFAT located 1,353 base pairs (bp) 
upstream of the CHRNAI translation initiation site. This motif was 
redundant but nonetheless an EMSA showed allele-specific binding 
of nuclear extracts (Supplementary Fig. 4). 

To discriminate between the effects of rs2600683 and rs16862847 
alleles on CHRNAI promoter activity, we tested the four possible 
allele combinations within the promoter (Fig. 2a) by transfection 
of reporter gene constructs into RAW264.7 cells and also a thymic 
epithelial cell (TEC) line derived from human neonatal thymic epi- 
thelium'’. Both cell lines expressed IRF8, as verified by polymerase 
chain reaction with reverse transcription (RT-PCR) and western 
blotting (data not shown), and yielded similar findings. Figure 2 
shows the results with the TEC line. Only rs16862847 allele variation 
influenced expression of the reporter luciferase gene; the minor G 
allele yielded a fivefold lower relative luciferase activity compared 
with the major A allele (P= 1.5 X 10“ for the comparison of P1 
versus P3 promoter activities with the Mann—Whitney U-test; 
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Figure 1 | rs16862847 allele variation affects IRF8 binding. a, EMSA. 
Nuclear extracts prepared from RAW264.7 macrophage cells were incubated 
with *?P-labelled oligonucleotide probes corresponding to either allele of 
1s16862847, in the presence or absence of an excess of competing unlabelled 
oligonucleotide. An NFAT probe was used as an irrelevant competitor. The 
disease-accelerating G allele lacks the upper band of the complex. 

b, Supershift assay. Addition of an antibody against IRF8 but not against 
IRF1 clears the A-allele-specific band and yields a supershifted band. 
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Fig. 2b), similar to the background level of the promoterless con- 
struct. Notably, this allele-specific expression depended on IRF8, as 
shown by RNA interference (Fig. 2c): co-transfection with the P1 
construct and short interfering (si)RNAs targeting IRF8 decreased 
luciferase expression to the level of an anti-luciferase siRNA pool. 
Moreover, treatment of the cells with interferon (IFN)-y upregulated 
the luciferase activity by a factor of 1.57 (P= 6 X 107“), but only in 
the presence of the major allele of rs16862847 (Fig. 2d). We verified 
the mRNA expression from the endogenous CHRNAI gene in the 
TEC line that was homozygous for the major A allele of rs16862847 
(data not shown); using chromatin immunoprecipitation, we 
showed specific binding of IRF8 to the promoter at the CHRNAI 
locus (P<0.05), but not to an irrelevant promoter (GAPDH) 
(Fig. 2e). Thus, in TECs, IRF8 binds to the CHRNAI promoter 
carrying the rs16862847 A allele and enhances its transcriptional 
activity. 

Next, we investigated the expression of CHRNA1 ex vivoin relation 
to rs16862847. We also tested for an influence of AIRE. Messenger 
RNA levels of both genes were determined in samples of medullary 
(m)TECs freshly purified from the thymi of 25 (non-autoimmune) 
infants undergoing thoracic surgery. The levels varied among indi- 
viduals but overall were closely correlated (Spearman’s R= 0.71, 
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Figure 2 | Characterization of IRF8-dependent CHRNAT transcriptional 
activity in the TEC line. a, Disease-associated alleles (asterisks) at rs2600683 
and rs16862847 sites were introduced by site-directed mutagenesis of P1, the 
common wild-type promoter, to generate P2—P4 constructs. b, Assay of 
promoter activities by relative luciferase activity. c, Effect of siRNA 
knockdown of IRF8 on P1 promoter activity. d, Specific effect of IFN-y on P1 
construct expression. For b-d, data represent mean + s.e.m. of eight 
transfections for each plasmid or condition. e, Binding of IRF8 to the 
endogenous CHRNAI promoter detected by ChIP. The GAPDH promoter 
was analysed as a negative control for IRF8. Data represent mean + s.e.m. of 
quadruplicate real-time PCR. 
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P=7%X 10”; Fig. 3a, black line), suggesting a role for AIRE in con- 
trolling CHRNAI transcription. A similar relationship was found in 
murine mTECs (Supplementary Fig. 5). The correlation in human 
mTECs was preserved in subjects homozygous for the rs16862847 A 
allele (Spearman’s R = 0.87, P= 1 X 107”) but was not significant in 
those with one or two G alleles. Moreover, G-allele-carrying indivi- 
duals showed a 1.8-fold lower mean level of CHRNA1 mRNA com- 
pared with AA homozygotes (Fig. 3b; P = 0.076 for Student’s t-test 
and P= 0.0062 when adjusted for AIRE level using a linear model). 
Notably, IRF8 mRNA was expressed in both cortical and medullary 
TECs, displayed little inter-individual variation, and did not correl- 
ate with AIRE or CHRNAI mRNA levels (Supplementary Fig. 6). 

To confirm the relationship between AIRE and CHRNAI mRNA 
expression and the involvement of rs16862847, we conducted co- 
transfection experiments in the TEC line using an AIRE expression 
plasmid and the CHRNA1 promoter luciferase reporter constructs. 
We found a strong influence of AIRE on transcription from the 
CHRNAI promoter for both rs16862847 alleles (Fig. 3c). Luciferase 
expression first increased almost linearly as a function of the amount 
of the AJRE expression plasmid transfected and then reached a plat- 
eau (P<1X10 !°), in contrast to the continuous increase of the 
AIRE message (Supplementary Fig. 7). Nevertheless, there was also a 
highly significant effect of rs16862847 variation (P< 1X10 '°), as 
reflected by a twofold decrease of luciferase expression with the G 
allele compared to A at all doses of AIRE plasmid. Moreover, treat- 
ment with siRNA targeting IRF8 significantly decreased the response 
of the promoter with the A allele but not with the G allele, indicating 
that IRF8 accounts for a large part of the allelic difference in AIRE- 
driven CHRNA1I promoter activity (Fig. 3d). 

The molecular mechanisms controlling the promiscuous express- 
ion of tissue-restricted auto-antigens in TECs are presently ill-defined 
and expression seems to be regulated at several levels’. After the 
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Figure 3 | Effect of rs16862847 and AIRE on CHRNA1 expression levels in 
mTECs. a, Correlation between CHRNAI and AIRE message levels in all 
purified human mTEC samples (black line), in rs16862847 AA homozygotes 
(blue line, diamonds) or in rs16862847 AG or rs16862847 GG samples 
(orange line, triangles). b, Decreased CHRNA1 expression associated with 
1816862847 G allele. Horizontal bars show medians of relative expression, 8.37 
for AA and 4.75 for AG or GG (P = 0.0062, adjusted for AIRE level). ¢, In vitro 
assay of the transactivation property of AIRE on CHRNAI P1 and P3 
promoter constructs. d, RNAi knockdown of IRF8 affects AIRE-stimulated 
activity of the CHRNA1 promoter with rs16862847 A allele. For ¢ and d, data 
represent mean = s.e.m. of four transfections for each condition. 
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discovery of AIRE, our results outline a molecular framework for 
the AIRE-dependent transcriptional control of such an auto-antigen 
in TECs. Several mechanisms of action have previously been consid- 
ered for AIRE. First, direct binding of AIRE to cis-acting DNA motifs; 
second, formation of higher order transcriptional complexes (with- 
out DNA binding); and third, modification of chromatin structure. 
Recent data (for discussion, see ref. 16) and our present findings are 
most consistent with the second mode. AIRE together with IRF8 
might constitute part of a higher order transcriptional complex; by 
decreasing the binding of such a complex to the CHRNA1I promoter, 
the rs16862847 minor allele would also affect the co-transcriptional 
activity of AIRE and weaken the correlation between AIRE and 
CHRNAI expression levels in mTECs (Fig. 3a, c). This does not 
exclude an additional effect of AIRE that is independent of IRF8, as 
increasing doses of AIRE also upregulate CHRNAI transcription in 
the minor allele in vitro (Fig. 3c, d). Alternatively, the role of AIRE 
could be more indirect. On the basis of the recent finding that AIRE in 
cooperation with CREB-binding protein is able to activate transcrip- 
tion of IFN-B'”"’, it is conceivable that a member of the interferon 
family under the control of AIRE might induce IRF8 expression (see 
Supplementary Discussion). 

Although the difference in mean thymic CHRNA1 mRNA levels 
between individuals carrying the A and the G alleles was less than 
twofold, similar differences between diabetes-resistant and diabetes- 
susceptible insulin variable number tandem repeat (VNTR) alleles 
are important’”’. Such sensitivity of central tolerance to relatively 
modest variations of antigen dose has also been described in animal 
models of autoimmunity**~ and has been explained by the threshold 
model of tolerance’ (see Supplementary Discussion). Thus, our data 
demonstrate dual control of thymic expression of an auto-antigen 
and highlight the role of central tolerance in protection from organ- 
specific autoimmunity in humans. How the genetic variant influ- 
ences the onset of myasthenia gravis requires further investigation. 


METHODS SUMMARY 

Patients and controls. A total of 330 unrelated Caucasian patients from two 
independent cohorts (180 French and 150 UK patients) were included in the 
study with their written informed consent and in keeping with the tenets of the 
Declaration of Helsinki. They fulfilled clinical and electro-myographical criteria 
for the diagnosis of acquired generalized myasthenia gravis. They had confirmed 
positive titres of anti-AChR auto-antibodies (>0.6nM), determined by radio- 
immunoprecipitation. Only non-thymoma patients were included. The control 
groups matched both populations and included 151 French and 109 UK volun- 
teering Caucasian blood donors. They were not matched for gender; however, 
gender did not influence the genotype distributions (Supplementary Table 4). 
Molecular methods and data analysis. Genetic analysis of the CHRNAI locus, 
EMSA protocols, cloning of the CHRNAI promoter and reporter gene assays, 
RNAi experiments, chromatin immunoprecipitation (ChIP), preparation and 
purification of medullary thymic epithelial cells, quantitative RT-PCR, and 
statistical methods are described in detail in the Methods. For transfections 
and ChIP, the data shown are representative of at least three experiments with 
similar results. For ChIP, independent chromatin preparations were used in each 
experiment. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Polymorphism detection, genotyping and SNP tagging. We identified poly- 
morphisms in CHRNAI by direct re-sequencing of genomic DNA from 13 
French myasthenia gravis patients and 3 controls. This sample of 32 chromo- 
somes provided a power of 80.1% to detect variants with a minor allele frequency 
of 0.05 at the 5% level of significance. Primers were selected with Primer3 (http:// 
frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi) to define 15 overlapping 
amplicons with a length up to 3 kb. These amplicons encompassed a genomic 
segment of 19.9kb, including 3.3kb upstream and 0.3 kb downstream of the 
CHRNAI gene (Supplementary Fig. la). Each amplicon was sequenced using 
both amplification and internal primers. Reaction products were migrated on 
an ABI 3100 automated sequencer (Applied Biosystems). We analysed sequen- 
cing chromatograms with the Phred/Phrap/Consed suite”. Polymorphisms 
were detected with Polyphred and verified manually. SNPs were genotyped 
by multiplexing single-base chain extension reactions, using the SNaPshot sys- 
tem (Applied Biosystems). Insertions/deletions were genotyped by direct re- 
sequencing. This first step led us to identify 70 polymorphisms, 31 of which 
were new (Supplementary Table 1). 

We then genotyped these 70 variants in 32 additional patients (45 patients in 
total). Of the 70 variants, 43 had a minor allele frequency >0.05. Tagging SNPs 
were selected using IdSelect”” with an r threshold at 0.75 and minor allele 
frequency threshold at 5%. Eleven tagSNPs were selected to represent five linkage 
disequilibrium (LD) groups, denoted A to E, and six independent SNPs (iden- 
tified with a hash in Supplementary Table 1 and in Supplementary Fig. 2). 

Finally, we genotyped the 11 tagSNPs in 180 French myasthenia gravis patients 
and 151 population-matched controls. Their genotypic frequencies did not 
deviate from Hardy—Weinberg proportions in either patients or controls. In 
the UK cohort, rs16862847 was genotyped by PCR-RELP, through a BsfXI diges- 
tion of a 126-bp fragment amplified with a forward primer 5'-TCAGCCT- 
AACACTGAAGCCAAGGAAATG-3’ and a reverse primer: 5’-TGTGGGTT- 
GACTTGATTAGGG-3’. 

Prediction of DNA-binding motifs. We used the MatInspector program from 
Genomatix (http://www.genomatix.de) to predict and compare DNA-binding 
motifs in promoter sequences with alternative alleles, and ModelInspector to list 
promoters including an interferon-stimulated response element. In intronic 
sequences we looked for regulatory elements in the literature, at splice site 
prediction of the Berkeley Drosophila Genome Project (http://www.fruitfly. 
org/seq_tools/splice.html) and at the alternative splicing workbench of the 
European Bioinformatics Institute (http://www.ebi.ac.uk/asd-srv/wb.cgi). 
EMSA. We carried out preparation of nuclear extract from RAW264.7 cells and 
EMSA using the NucBuster Protein Extraction Kit and the EMSA Accessory Kit 
(Novagen). rs16862847 and rs2600683 probes were made by annealing of over- 
lapping complementary DNA oligo-nucleotides. rs16862847 G allele, forward 
5’-GTGAAGCAGAGGAAATGGAACTGAAGCCATGT-3’, reverse 5’-GGAAA- 
CATGGCTTCAGTTCCATTTCCTCTGCT-3’; 1s16862847 A allele, forward 5'- 
GTGAAGCAGAGGAAATGAAACTGAAGCCATGT-3’, reverse 5'-GGAAACA- 
TGGCTTCAGTTTCATTTCCTCIGCT-3’; rs2600683 C allele, forward 5’- 
GTGAGAACATTTTTCCACTCATGCCACT-3’, reverse 5'~GGCCAGTGGCA- 
TGAGTGGAAAAATGTTC-3’; 152600683 G allele, forward 5’-GTGAGAAC- 
ATTTTTCGACTCATGCCACT-3’, reverse 5’-GGCCAGTGGCATGAGTCG- 
AAAAATGTTC-3’. Underlined nucleotides correspond to the rs2600683 and 
rs16862847 alleles and italic sequences to the IRF8- and NFAT-binding 
motifs. We labelled probe overhangs with [o-**P]dCTP. We incubated 
RAW264.7 nuclear extracts with 10 fmol of labelled probe for 30 min at 4 °C. 
For competition experiments, before adding to the extract, we mixed the labelled 
probe with a 500-fold or 1,500-fold molar excess of unlabelled probe or alter- 
natively the NFAT probe 5’-CGCCCAAAGAGGAAAATTTGTTTCATA-3' 
(Santa Cruz Biotechnology). For supershift assays, we added 2 pg of antibodies 
against IRF8 (C-19) or IRF1 (M-20) (Santa Cruz Biotechnology) for 20 min after 
incubation with the labelled probe. We separated protein-DNA complexes on a 
non-denaturing 6% Novex retardation gel (Invitrogen) in 0.5 TBE buffer. The 
gel was dried and auto-radiographed. 

Promoter cloning and reporter gene assays. We amplified a fragment corres- 
ponding to nucleotides —3315 to —1 (relative to the ATG start codon +1) by 
PCR using the PfuUltra Hotstart DNA polymerase (Stratagene) of one indi- 
vidual (ID05 on Supplementary Fig. 2). This individual was homozygous for 
the upstream region common haplotype (44%) characterized by common 
alleles at each polymorphism position, including G for rs2600683 and A for 
rs16862847. We used primers tailed with Nhel and Xhol restriction sites to 
sub-clone the fragment into the Nhel and Xhol sites of the pGL3-Basic vector 
(Promega). Three vectors were derived by site-directed mutagenesis with the 
QuickChange II XL site-directed mutagenesis kit (Stratagene), inserting the 
minor allele, C for rs2600683, G for rs16862847 or both. We verified the four 
constructs by DNA sequencing. Maxipreps of each were made using the 
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EndoFree plasmid maxi kit (Qiagen). Transient transfection assays were con- 
ducted in a human thymic epithelial cell (TEC) line (clone P1.4D6)"° and in a 
murine macrophage cell line (RAW264.7) using the Lipofectamine 2000 trans- 
fection reagent (Invitrogen), in 96-well plates with 4-8 replicate wells per con- 
dition per experiment in at least three independent experiments. Transfections 
were carried out with 200 ng of the expression plasmid and 4 ng of phRL-SV40 
vector (Promega) (internal control for transfection efficiency) in serum-free 
RPMI medium. Serum was added at a final 10% concentration after an overnight 
reaction. After a 24-h transfection period, we collected cells and measured luci- 
ferase activity using the Dual-Luciferase Reporter Assay System (Promega). 

For interferon activation, we incubated the transfected cells with 10 ng ml ‘of 
IFN-y (R&D systems) for another 24 h. Promoter activity is expressed as the ratio 
of firefly to Renilla luciferase values. 

For co-transfection experiments, we used the AJRE-expression plasmid 
pcDNA3.1-AIRE” provided by P. Peterson. Increasing doses of AIRE plasmid 
were mixed with the P1 or P3 construct and with appropriate amounts of an 
irrelevant plasmid to keep the total amount of transfected DNA constant. Mann— 
Whitney U-tests were used to assess the significance of differences between 
constructs. 

RNA interference experiments. To investigate effects of IRF8 on CHRNAI 
expression, we used a pool of four specific siRNA duplexes for IRF8 designed 
to minimize off-target effects (On-TargetPlus Smartpool, Dharmacon; see 
below) as well as non-targeting duplexes, RISC-free duplexes (negative controls) 
and firefly-luciferase-targeting duplexes (positive control). Short interfering 
RNAs were co-transfected with the different constructs at a final concentration 
of 100 nM, as recommended by the manufacturer. Luciferase activity was assayed 
as above after a 48-h transfection period. Inhibition of IRF8 expression was 
monitored by immunoblot analysis of cells extracts, using anti-IRF8 antibody 
(C19, Santa-Cruz Biotechnology) and a peroxidase-coupled secondary antibody 
with ECL-plus luminescent substrate (Amersham), showing a 37% average 
decrease. The sequences of the sense and anti-sense oligonucleotides forming 
the four duplexes of the anti-IRF8 pool are: duplex 1, 5'-GAGGUUACGC- 
UGUGCUUUGUU-3’ and 5’-CAAAGCACAGCGUAACCUCUU-3’; duplex 
2, 5'’-CCGGAUGUUUCCAGAUAUUUU-3’ and 5'-AAUAUCUGGAAACA- 
UCCGGUU-3’; duplex 3, 5’-CAUACAAAGUUUACCGAAUUU-3’ and 5'- 
AUUCGGUAAACUUUGUAUGUU-3’; duplex 4, 5’-AGCCUUCUGUGGA- 
CGAUUAUU-3’ and 5’-UAAUCGUCCACAGAAGGCUUU-3". 

ChIP assay. ChIP was performed using the ChIP-IT kit (Active Motif) according 
to the manufacturer’s instructions with slight modifications. Briefly, cells were 
fixed with 1% formaldehyde for 30 min at room temperature. After neutraliza- 
tion, cells were lysed and the extracts were sonicated (20s each, 10 times at 25% 
power using a Vibracell VC100 sonicator) to shear the chromatin into 200- 
1,000-bp fragments. The resulting sheared chromatin was pre-cleared with sal- 
mon sperm DNA/protein G agarose beads and incubated with 5 1g of normal 
goat immunoglobulin G (IgG) or goat anti-IRF8 antibody (C-19, Santa-Cruz 
Biotechnology) overnight at 4 °C with rotation. Before incubation, a fraction of 
the pre-cleared chromatin was saved as an input control. Antibody/chromatin 
complexes were precipitated by addition of salmon sperm DNA/protein G agar- 
ose beads and eluted after extensive washing of the beads. Immunoprecipitated 
DNA was purified following reversal of cross-linking and digestion of RNA and 
proteins with RNase A and proteinase K, respectively. Levels of precipitated 
promoter DNA (ChIP signals) were quantified by quadruplicate real-time 
PCR and shown as values relative to the input DNA. The following primer pairs 
were used (sense and anti-sense, respectively): CHRNAI1 promoter 5'- 
CTGAAATCCAGGCAGTCAGA-3' and 5'-CTGCAAACATGGCTTCAGTT- 
3'; GAPDH promoter (irrelevant control), 5'-TACTAGCGGTTTTACGG- 
GCG-3' and 5'-TCGAACAGGAGGAGCAGAGAGCGA-3’. 

Isolation of hTECs, RNA isolation and cDNA synthesis. Samples of thymic 
tissue from 25 patients (7 females and 18 males; age range from 3 days to 3 yr) 
were obtained in the course of corrective cardiac surgery at the Department of 
Cardiac Surgery, Medical School, University of Heidelberg. None of the patients 
exhibited symptoms of an autoimmune disorder at the time undergoing treat- 
ment. This study has been approved by the Ethics Committee of the Medical 
School of the University of Heidelberg. 

Thymic epithelial cell (TEC) purification, RNA isolation and cDNA synthesis 
were performed as described previously with the following modifications. 
TECs were enriched on a discontinuous Percoll density gradient after sequential 
digestion of thymic tissue by collagenase-dispase and trypsin. Typical flow cyto- 
metry plots used for sorting TECs are shown in Supplementary Fig. 8. The cDNA 
was purified by centrifugation through MicroSpinG-50 columns (Amersham). 
Quantitative PCR. Real time PCR reactions were performed in final volumes of 
25 ul with the optimal concentration of the forward and reverse primers (50— 
900 nM) using the qPCR core kit for SybrGreen I (Eurogentec) containing Hot 
GoldStar polymerase and uracil-DNA glycosylase. Reactions were run on a 
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sequence detection system (GeneAmp 5700; Applied Biosystems) in triplicate 
and expression values were normalized to GAPDH expression using the com- 
parative Cy method. Each measurement was done twice. Primers were purchased 
from MWG Biotech AG, and when possible, were designed to span at least one 
intron. The following primer pairs were used (sense and anti-sense, respectively): 
AIRE, 5'-GAACGGGATTCAGACCATGT-3' and 5'-TCTTCGAACTTGCTGG- 
GAGT-3'; CHRNAI, 5’-TCGTCACCACTTTCCCTTT-3’ and 5’-CCGCTC- 
TCCATGAAGTTGCT-3’; GAPDH, 5'-TCGACAGTCAGCCGCATCT-3’ and 
5'-CCGTTGACTCCGACCTTCA-3’, IRF8, 5'-CGAGCCATACAAAGTTTAC- 
CGAAT-3' and 5'-TCACGCAGCCAGCAGTTG-3’. 

Statistical analysis. The age distributions at disease onset were normalized by 
square root transformation (Shapiro—Wilk’s W test not significant). The effect of 
the 11 tagSNPs on this quantitative trait was tested in the French cohort by 
forward stepwise multiple linear regression. The influence of rs16862847 on this 
same trait in the UK cohort was tested by a Student’s t-test. Homogeneity of 
variances in patients with or without the risk allele was verified (Levene’s test and 
Brown-Forsythe test not significant). We also ruled out a bias by potential out- 
liers by removing the upper and lower five percentiles of the distribution, alto- 
gether making the finding robust. Calculations of the statistics and of P-values 
were carried out with the Statistica 6.0 software (Statsoft Inc.). Odds ratios, their 
95% confidence interval (CI), the exact P-values of 2 X 2 tables and stratified 
2 X 2 tables corresponding to the two cohorts of patients were calculated with 
the StatXact software v7 (Cytel corp). 

The analysis of quantitative PCR data was based on AC; values. We tested a 
correlation between AIRE and CHRNAI message levels with the Spearman rank 
R coefficient. The linear model analysis with the factors rs16862847 and AIRE 
was performed with PROC GLM of SAS version 8.2 (SAS Institute Inc.). 
Variances of CHRNAI residuals after removal of AIRE-correlated variations 
were homogeneous in rs16862847 genotypic groups (AA versus AG+GG). 
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sequencer traces using phred. |. Accuracy assessment. Genome Res. 8, 175-185 
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A central integrator of transcription networks in plant 
stress and energy signalling 


Elena Baena-Gonzalez’’, Filip Rolland’***, Johan M. Thevelein?”’ & Jen Sheen’ 


Photosynthetic plants are the principal solar energy converter 
sustaining life on Earth. Despite its fundamental importance, little 
is known about how plants sense and adapt to darkness in the daily 
light-dark cycle, or how they adapt to unpredictable environ- 
mental stresses that compromise photosynthesis and respiration 
and deplete energy supplies. Current models emphasize diverse 
stress perception and signalling mechanisms’*. Using a com- 
bination of cellular and systems screens, we show here that the 
evolutionarily conserved Arabidopsis thaliana protein kinases, 
KIN10 and KINI1 (also known as AKIN10/At3g01090 and 
AKIN11/At3g29160, respectively), control convergent repro- 
gramming of transcription in response to seemingly unrelated 
darkness, sugar and stress conditions. Sensing and signalling 
deprivation of sugar and energy, KIN10 targets a remarkably 
broad array of genes that orchestrate transcription networks, pro- 
mote catabolism and suppress anabolism. Specific bZIP trans- 
cription factors partially mediate primary KIN10 signalling. 
Transgenic KIN10 overexpression confers enhanced starvation 
tolerance and lifespan extension, and alters architecture and 
developmental transitions. Significantly, double kin10 kin11 defi- 
ciency abrogates the transcriptional switch in darkness and stress 
signalling, and impairs starch mobilization at night and growth. 
These studies uncover surprisingly pivotal roles of KIN10/11 in 
linking stress, sugar and developmental signals to globally regulate 
plant metabolism, energy balance, growth and survival. In con- 
trast to the prevailing view that sucrose activates plant SnRK1s 
(Snfl-related protein kinases)**, our functional analyses of 
Arabidopsis KIN10/11 provide compelling evidence that SnRK1s 
are inactivated by sugars and share central roles with the ortholo- 
gous yeast Snfl and mammalian AMPK in energy signalling. 

We surveyed the public transcriptome databases using the gnome 
tool Genevestigator’ and discovered that dark-induced (DIN) genes 
are activated under diverse stress conditions and repressed by sugars 
and light (Supplementary Fig. 2a). Consistently, quantitative 
analyses confirmed that DIN1 (At4g35770) and DIN6 (At3g47340) 
were activated by various stresses—such as darkness, herbicide (3- 
(3,4-dichlorophenyl)-1,1-dimethylurea, DCMU)* and_ flooding/ 
submergence—that limit photosynthesis and respiration, and hence 
sugar and energy supplies in isolated mesophyll cells and whole 
plants. This activation was antagonized by both glucose and sucrose 
(Fig. la—c, and Supplementary Fig. 2b). The results suggest that plant 
cells sense sugar or energy depletion in a cell-autonomous manner 
and trigger convergent gene activation in response to diverse and 
seemingly unrelated stress signals independent of light. 

We have previously shown that the Arabidopsis glucose sensor 
hexokinase 1 (HXK1) is essential for the glucose repression of 
photosynthesis genes’. However, DIN gene repression by glucose in 


the dark was still active in the HXK1-null mutant gin2 (glucose- 
insensitive2) (Fig. lc), indicating the involvement of an HXK1- 
independent signalling pathway’. To identify the key regulators 
governing this stress/sugar signalling pathway, we developed a cell 
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Figure 1| DIN genes are activated by diverse stresses and repressed by 
sugars. Stress induces DIN gene expression in Arabidopsis leaves (a), and 
isolated mesophyll cells (b). The induction is diminished by glucose (G) and 
sucrose (S). ¢, The gin2 mutant exhibits normal DIN gene induction and 
sugar repression. d, Diverse stresses activate the DIN6-LUC reporter. 
Glucose antagonizes DIN6-LUC activation. e, DIN6-LUC is specifically 
activated by SnRK1s. f, DIN6-LUC induction requires KIN10 kinase activity 
(K48) and T-loop phosphorylation (T175). D, dark; H, hypoxia; C, control 
DNA. Control, DCMU and hypoxia treatments were performed under light 
(50 uumol ms), Error bars, s.d. (1 = 3). Protein expression was 
confirmed by western blot analysis. 
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model system using a sensitive reporter, generated by fusing the 
putative promoter of Arabidopsis DIN6 (encoding the glutamine- 
dependent asparagine synthetase, ASN1) to the luciferase (LUC) gene 
(Methods Summary). In transfected mesophyll protoplasts, DIN6- 
LUC was activated by darkness, DCMU, hypoxia/submergence and 
other stresses within 3-6h (Fig. 1d, and J.S., unpublished), in a 
similar way to the endogenous DIN6 gene®*''”’ (Fig. la—c; and 
Supplementary Fig. 2a). This activation was repressed by sucrose 
or glucose at physiological concentrations as low as 5 mM (Supple- 
mentary Fig. 2b). Interestingly, endogenous DIN6 activation by 
diverse signals was abolished by the protein kinase inhibitor K252a 
(Supplementary Fig. 2c), suggesting the requirement of protein 
kinases in the integration of stress and sugar signals. 

We first investigated the involvement of members of the Snfl- 
related kinase (SnRK) gene families’ (Supplementary Fig. 3a), impli- 
cated in metabolic regulation owing to their sequence homology to 
the yeast Snfl and mammalian AMPKs*"*". Cell-based functional 
screens revealed the specific activation of DIN6-LUC by the two 
ubiquitously expressed members of the SnRK1 group, KIN10 and 
KIN11 (Fig. le, and Supplementary Fig. 3b, c). Representative 
SnRK2 and SnRK3 members lacked the same function (Fig. le) des- 
pite their established protein kinase activities in osmotic and salt 
stress responses (Methods Summary). Expression of KIN10 and 
KIN11 in protoplasts conferred significant kinase activity, which 
required the conserved ATP binding site (K48 and K49) and T175 
(ref. 14; Supplementary Fig. 4, and Fig. 1f). However, unlike T172 
phosphorylation in AMPK, T175 phosphorylation in KIN10 was not 
correlated with activation by dark, DCMU or hypoxia treatment 
(data not shown). Thus, DIN6-LUC served as a sensitive and quant- 
itative transcription reporter to monitor endogenous KIN10/11 
activity in vivo. 

To identify critical DNA sequences involved in the stress and 
KIN10-mediated responses, we performed systematic mutagenesis 
of predicted cis-regulatory elements (Supplementary Methods) in 
the DIN6 promoter (Supplementary Fig. 5, and Fig. 2a). Signifi- 
cantly, specific mutation of the G-box (CACGTG, G1) proximal to 
the TATA box abolished most of the DIN6-LUC activation by KIN10, 
hypoxia, darkness and DCMU (Fig. 2b-d). The results indicate that a 
common cis-element mediates convergence of diverse signals, most 
probably through KIN10. 

To substantiate this finding further, we screened Arabidopsis bZIP 
transcription factors'® that have been characterized as G-box binding 
factors (GBFs), and found that GBF5/bZIP2 (At2g18160) specifically 
activated DIN6-LUC (Fig. 2e). Importantly, co-expression of KIN10 
with GBF5 resulted in a dramatic and synergistic effect on DIN6-LUC 
activation. Both the GBF5 effect and its synergism with KIN10 were 
abolished by the specific DIN6 promoter mutation in the G-box gl 
(Fig. 2f). Similar results were obtained with closely related bZIP 
S-group members expressed in mesophyll cells (bZIP11/ATB2, 
At4g34590; bZIP53, At3g62420; and related bZIP1, At5g49450; 
Supplementary Table 1 and Fig. 2g, h). Functional redundancy of 
the transcription factors was evident also from the lack of overt 
phenotypes in the available loss-of-function single bzip mutants 
(data not shown). Because the S- and C-group bZIPs have been 
shown to form functional heterodimers'®, we also examined the 
activity of C-group bZIPs. Interestingly, one member of the 
C-group, bZIP63, showed little activity alone but exhibited moderate 
synergism with KIN10 through the G-box (Fig. 2h). These studies 
identified novel functions of specific GBFs/bZIPs and provide further 
evidence for convergent responses in stress signalling through KIN10 
and KINI1. 

To explore the extent of KIN10 transcriptional regulation and 
identify its early target genes, we scaled-up transient protoplast 
expression experiments, which could rule out secondary or long- 
term effects of metabolism and growth, and circumvent limitations 
caused by redundancy and embryonic lethality observed in mammals 
and plants'*'”’*. Gene expression profiles with or without KIN10 
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expression were compared using Arabidopsis whole-genome ATH1 
GeneChips (GEO accession number GSE8257; Supplementary 
Methods). To filter candidate KIN10 target genes (at least twofold 
change with Pvalue < 0.0004), we combined independent experi- 
mental validation using quantitative real-time reverse transcriptase 
PCR (qRT-PCR) with vigorous statistical analyses of GeneChip data 
from biological replicates, using parametric and non-parametric 
methods (see Supplementary Methods and Supplementary Fig. 6 
for details). Guided by the experimentally validated MYB75 trans- 
cription factor gene and 24 other marker genes, we were able to define 
1,021 putative KIN10 target genes that exhibited reproducible activa- 
tion or repression (Supplementary Tables 1-3). 

To increase robustness and assess the physiological significance of 
the co-regulation of KIN10 target genes, we performed additional 
computational screens to identify seven highly correlated gene 
expression patterns from published data sets generated with the 
Arabidopsis ATH1 GeneChips''*'?* (details in Supplementary 


a DIN6 promoter mutants b a6 
555 4 w KINI 
—402 Messenger 


RNA 


"SF EHTS HM 


D2 
c d 12 
30) Control Control 
Hypoxia Hypoxia 
2 20 38 Dark 
a o) DCMU 
5 3 
> 10 | j | 34 
0: - | J ==, 0! a 
LP CBH SGI WT gl 
e f 
16 
a) ® 200) WT gt 
5 =) 
© © 
6 8 S 100 
3 = 
ae ee 0 =o aiaianias 
Coe’ oct of? o¢h o¢% CEWw GO WOO 
Ce Ge GO GO GP Les SRS 
oe ees 
Anti-HA —__ — & & 
RC a a a 
g h 
12 —KIN10 120 : +KIN10 
I 7) 
28 ; 2 80 
oO —d 
CS 4 * S 40 : 
5 a : 
0 (ane 0 a -__ i 
“LEE LIME “SELLE CLE 
aS NS AY AS a te) 
GOWN 4 LVN COTM YN ONLY 


Figure 2 | Convergent transcriptional control through a G-box and specific 
GBFs. a—d, Mutational analysis of the DIN6 promoter. G, G-box; T, 
TATCCA element; D, DOF-binding cis-element; C, C-box/ACGT/OSAMY3. 
Numbers are relative to ATG. The g/ mutation prevents activation by KIN10 
(b), hypoxia (c), dark and DCMU (d). e, DIN6-LUC is specifically induced 
by GBES. f, KIN10 and GBF5 synergistically activate DIN6-LUC through the 
G1 box. g, Functional redundancy of S-group bZIPs (grey) in DIN6-LUC 
activation. h, Synergistic activation of DIN6-LUC by KIN10 and S-group 
(grey) and C-group (black; bZIP63) bZIPs. WT, wild-type DIN6 promoter; 
C, control DNA. Control, DCMU, and hypoxia treatments were performed 
under light. Error bars, s.d. (n = 3). 
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Methods). Specifically, gene expression profiles generated under 
various sugar and energy starvation conditions'!”°” revealed a strik- 
ing positive correlation with KIN10 target genes (Pearson correlation 
coefficient 0.85 to 0.87, Supplementary Table 5). Importantly, KIN10 
target genes also exhibited strong negative correlations (Pearson 
correlation coefficient —0.87 to —0.92, Supplementary Table 5) with 
the gene expression profiles obtained from glucose- or sucrose- 
treated seedlings and differential CO>-fixing adult leaves in intact 
plants'?"*"”. 

A final filtering step was applied to remove genes with inconsistent 
expression patterns in any of the seven data sets. The stringent 
and multi-step filtering processes selected a reliable list of 278 genes 
co-activated by KIN10 and sugar starvation conditions, and co- 
repressed in sugar-treated seedlings or CO,-fixing adult leaves 
(Fig. 3a, and Supplementary Table 4). A second list of 322 genes 
was also identified on the basis of their co-repression by KIN10 
and sugar starvation conditions, but co-activation in sugar-treated 
seedlings or differential CO,-fixing adult leaves (Fig. 3b, and 
Supplementary Table 4). As predicted by the DIN gene response 
(Fig. 1), hypoxia-treated protoplasts also displayed gene expression 
profiles similar to KIN10 overexpression on the basis of ATH1 
GeneChip (GEO accession number GSE8248) and triplicated ana- 
lyses of selected marker genes by qRT-PCR (Supplementary Fig. 7a, 
b). The correlation across the data sets (Supplementary Table 5) was 
remarkable considering the diversity of materials and experimental 
conditions used for the comparison and filtering (see Supplementary 
Methods). This suggests that KIN10 regulation is a cell-autonomous 
and ubiquitous event and highlights its physiological relevance. 
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Figure 3 | Global gene expression regulation by KIN10. a, b, Transient KIN1O 
expression in protoplasts results in the induction (a) and repression (b) of 
genes involved in a wide variety of cellular processes and metabolism 
(Supplementary Fig. 7c—e, and Supplementary Tables 3, 4). The KIN10- 
mediated gene expression profile is positively correlated with that regulated 
by sugar and carbon deprivation, and negatively correlated with that 
controlled by sugars (see text for details). Ext., extended; sen., senescence; 
starv., starvation; suc., sucrose. 
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Previous studies have identified specific SnRK1 target genes, one 
each in potato and wheat*”. The present work provides for the first 
time a complete overview of the surprisingly broad genome-wide 
transcript changes induced by these conserved protein kinases in a 
multicellular organism. The most prominent KIN10-activated (and 
sugar-repressed) genes represented a variety of major catabolic path- 
ways, including cell wall, starch, sucrose, amino acid, lipid, and pro- 
tein degradation that provide alternative sources of energy and 
metabolites (Supplementary Fig. 7c, and Supplementary Tables 3, 
4). KIN10 also induced several APG/ATG (AUTOPHAGY) ortholo- 
gues and plant-specific trehalose metabolism genes ( TPS8-11), which 
potentially alter or sense the level of trehalose-6-P, a regulator of 
plant carbohydrate metabolism, growth and development”. 
Conversely, a large set of genes involved in energy-consuming ribo- 
some biogenesis (87 genes) and anabolism (Supplementary Fig. 7d, 
e) were co-ordinately repressed by KIN10 (Supplementary Tables 3, 
4). Although many plant metabolic pathways have been well char- 
acterized at the biochemical level, their molecular regulatory 
mechanisms are still poorly understood. Thus, besides the well- 
known roles of SnRK1 in phosphorylating and modulating enzymes 
important for carbon and nitrogen metabolism*'*”*”®, the present 
work identifies KIN10 and KIN11 as global regulators of gene 
expression involved in primary and secondary metabolism and pro- 
tein synthesis''"*!°°?”7, The results also reveal central and previously 
unrecognized regulatory roles of the plant SnRK1 (and possibly also 
mammalian AMPK) in controlling a large number of genes encoding 
transcription factors, chromatin remodelling factors, and a plethora 
of signal transduction components (Supplementary Fig. 7c, e, and 
Supplementary Tables 3, 4). 

Significantly, many KIN10 marker genes induced by multiple 
stresses are primary glucose repressible genes insensitive to cyclohex- 
imide’ and direct targets of GBF5 (Supplementary Fig. 7f). These 
genes are not induced by the inactive KIN10 K48M or T175A mutant 
(data not shown), pointing to a requirement of KIN10 protein kinase 
activity for this global gene expression response. Our results suggest 
that KIN10/11 initiate multiple transcription cascades in response 
to sugar or energy depletion in darkness and stress conditions. 
Coordinated transcription activation of direct target genes is at least 
in part exerted through GBF5 and related bZIP transcription factors. 
Future analyses will aim at elucidating the precise molecular link 
between the protein kinases and transcription factors. 

To study the long-term effects of KIN10 activity at the whole-plant 
level, we generated Arabidopsis transgenic plants overexpressing 
KIN10 (O1 and O2) or with reduced KIN10 expression using RNA 
interference (RNAi; 10-1 and 10-2) (Supplementary Fig. 8a). The 
KIN10 overexpression seedlings displayed some advantage in prim- 
ary root growth and development under low light with limited energy 
supply (Supplementary Fig. 8b, e). With exogenous sucrose, KIN10 
overexpression plants showed reduced growth in shoots and roots, 
possibly owing to KIN10 repression of biosynthetic activities 
(Supplementary Fig. 8c—e). In contrast, the KIN10 silenced lines were 
able to use exogenously supplied sucrose (Supplementary Fig. 8c—e) 
and glucose (data not shown) more efficiently and exhibited 
enhanced shoot and root growth. Growth on 3% sucrose also 
increased anthocyanin accumulation in wild-type, 10-1 and 10-2 
plants, but not in the KIN10 overexpression lines (Supplementary 
Fig. 8f). Consistently, KIN10 overexpression repressed the expression 
of MYB75/PAP1 (At1g56650) (Supplementary Table 3), a key tran- 
scription factor for anthocyanin biosynthesis (Supplementary 
Methods). 

A role of KIN10 in the starvation response was most evident after 
transferring seedlings to glucose-free liquid medium under a low 
light intensity that limits photosynthesis. Whereas wild-type seed- 
lings underwent rapid senescence on nutrient deprivation, KIN10 
overexpression promoted plant survival (Fig. 4a). When grown in 
soil, KIN10 overexpression altered inflorescence architecture (Fig. 4b, 
c) and delayed flowering and onset of senescence under long-day 
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conditions (20h light/4h dark)(Fig. 4c). This is reminiscent of life- 
span extension by calorie restriction in animals and provides evid- 
ence for a new role of KIN10 in determining plant shape and 
developmental transition timing and linking this to metabolism 
and environmental cues. Soil-grown 10-1 and 10-2 lines did not show 
an obvious phenotype, probably owing to functional redundancy of 
KIN11. 

To circumvent the experimental limitation of obtaining double 
loss-of-function mutants*'”'*, we generated kin11 single- and kin10 
kin11 double-mutant plants, using virus-induced gene silencing 
(VIGS)** of KIN11 in wild-type and 10-1 or 10-2 transgenic seedlings. 
Reduction of KIN transcripts was confirmed by RT-PCR and 
immunoblotting (using KIN10 and P-AMPK antibodies; Supple- 
mentary Fig. 9a) before extensive phenotypic and molecular studies. 
Three weeks after infiltration, double mutants d-1 and d-2 showed a 
surprisingly dramatic growth defect with small leaves and short 
petioles under 13h light/11h dark (Fig. 4d, and Supplementary 
Fig. 9b) or constant illumination conditions (data not shown). A 
strong accumulation of anthocyanins, curling of new leaves (leaves 
11-12) and symptoms of early senescence were visible (Fig. 4d). The 
impact on growth and senescence in d-1 and d-2 lines was most 
striking five weeks after infiltration, when the transition to the repro- 
ductive phase occurred in wild-type plants (Fig. 4e). The most 
severely silenced plants senesced before flowering (Fig. 4e), whereas 
those with milder silencing bolted but failed to produce viable 
primary inflorescences and axillary floral meristems (Fig. 4f). 
Unlike the snfla snf1b double mutant in moss”, the growth pheno- 
types could neither be rescued by continuous light irradiance nor 
by supplementation with 1% sucrose (data not shown). This 
argues against the growth defects being merely due to impaired cata- 
bolism, and supports a more fundamental and unexpected role of 
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Figure 4 | Role of KIN10 in plant development, stress responses and starch 
mobilization at night. a—c, KIN10 overexpression (O1, O2) prevents 
nutrient-deprivation-induced senescence (a), alters inflorescence 
development (b, c), and delays long-day-triggered senescence (c). d-f, kin10 
kin11 VIGS double mutant (d-1, d-2) plants show growth arrest (d) and 
premature senescence that precludes flowering (e), or bolting without 
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KIN10/KIN11 in normal vegetative and reproductive growth and 
development in flowering plants. 

To establish a definitive molecular and quantitative link between 
KIN10/11 action and the ability to mount transcription activation in 
response to stresses and energy deprivation, we examined the res- 
ponse to hypoxia, dark or DCMU in mesophyll cells and intact leaves 
of wild-type, single- and double-mutant plants. All KIN10 marker 
genes tested could no longer be activated by any of these stresses in 
the d-1 and d-2 double mutants (Fig. 4g, h; Supplementary Fig. 9c, d, 
and Supplementary Table 6). Consistent with a role of KIN10/11 in 
promoting catabolic processes, leaf starch—as a major carbon source 
in the absence of photosynthesis—remained high at the end of night 
in the d-1 and d-2 double mutants (Fig. 4i). This suggests that KIN10/ 
11 may have a previously unrecognized regulatory role in starch 
mobilization at night*®. Alternatively, starch accumulation may 
reflect a reduced energy demand for growth. In the future, it will 
be interesting and informative to extend the analysis of gene express- 
ion reprogramming to changes in enzyme activities and metabolite 
homeostasis in response to environmental cues associated with 
KIN10/KIN11 regulation. 

Our studies provide molecular, biochemical, genetic and genomic 
evidence for the essential functions of KIN10/KIN11 in plant pro- 
tection and survival under stress, darkness and sugar deprivation 
conditions. Surprising roles of KIN10/KIN11 in vegetative and 
reproductive growth and developmental transition under normal 
conditions are uncovered (Supplementary Fig. 1). Their roles as 
energy sensors and integrators, orchestrating global transcription, 
may also be found in mammals and thus our findings in plants 
may contribute to the understanding of AMPK functions in relation 
to diabetes, cancer, obesity and longevity. Further dissection of 
the components and molecular mechanisms of the KIN10/KIN11 
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signalling cascades will provide valuable information on the meta- 
bolic control of plant growth and development, which may enable a 
more targeted genetic modification of plant development, architec- 
ture, carbon allocation, and stress resistance—all major determinants 
of crop yield and renewable energy production. 


METHODS SUMMARY 

Primers. All primers used are listed in Supplementary Table 7. 

Effector and reporter constructs. KIN10, KIN11, ASK1, ASK2 and the full- 
length and truncated SOS2 (ASOS2, encoding amino acids 1-308) were fused 
to the haemagglutinin (HA) tag and cloned between a 35S-derived promoter and 
NOS terminator. The 624-bp promoter/5’-UTR sequence of DIN6 was fused to 
the LUC reporter gene to generate DIN6-LUC. 

Protoplast transient expression assay. Protoplast assays were performed using 
UBQI10-GUS as a transfection control. Protoplasts (1-4 x 10*) were incubated 
for 6h in 1 ml of mannitol buffer in 6-well plates, or submerged in 1 ml of buffer 
in a 1.5 ml microfuge tube for hypoxia treatment. Protein expression was ana- 
lysed using KIN10-specific rabbit antibody generated against a glutathione 
S-transferase (GST)-fused KIN10 polypeptide (amino acids 335-388), human 
anti-P-AMPK (New England Biolabs), and a monoclonal anti-HA antibody 
(Roche). For dark treatment, plates were covered with aluminium foil. DCMU 
was added at 201M. For global gene expression analyses, transfection experi- 
ments were scaled up 50-fold. For details on GeneChip experiments, data pro- 
cessing and analysis, see Supplementary Information. 

Protein kinase assays. These were performed using immunoprecipitated HA- 
tagged kinases and SPS peptide (RDHMPRIRSEMQIWSED) as a substrate. 
Transgenics. To generate KIN10 overexpression lines, the coding region was 
cloned in a pCB302-derived minibinary expression vector. For KIN10 RNAi, the 
last 238 bp of the coding region (primers: 5'-cgggatcccagcagcgcagatggtatg-3’, 5’- 
aaggcctcatgccatggtcagaggactcggagctgag-3’) were cloned in both orientations, 
separated by an intron. 

Virus-induced gene silencing (VIGS). KIN11 VIGS was performed as 
described** using a gene-specific 503-bp KIN11 fragment (—103 to +400, pri- 
mers: 5’-ggaattcgtttctgtatatttcttcgctc-3', 5’-cgggatccagtactctacaccagatattat-3’) 
and repeated six times with consistent results. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Phylogenetic tree. For generation of the phylogenetic tree (Supplementary Fig. 
3a), protein sequences were aligned with ClustalW, using the SDSC Workbench 
(workbench.sdsc.edu). 

Effector and reporter constructs. Effector constructs—KIN10 (SnRK1.1; 
At3g01090), KINII (SnRK1.2; At3g29160), and SOS2  (SnRK3.11; 
At5g35410)—were amplified from leaf complementary DNA, fused to the HA 
tag, and cloned between a 35S-derived promoter and NOS terminator **~*’. The 
ASK1 (SnRK2.4; Atlg10940) and ASK2 (SnRK2.1; At5g08590), and the consti- 
tutively active version of SOS2 (ASOS2, amino acids 1-308) were as 
described*'***°, Reporter construct: The 624-bp promoter/5'-UTR sequence 
of DING (At3g47340) was fused to the LUC reporter gene to generate DIN6- 
LUC as described*****. The kinase (K48M, K49M, T175A) and DIN6 promoter 
mutants were generated by site-directed mutagenesis, as described’. 
Protoplast transient expression assay and treatments. Protoplast transient 
expression assays were carried out as described****”’, using 35S-GUS or 
UBQ10-GUS reporter gene fusions as transfection controls***’. Data were gen- 
erated from at least three independent experiments with consistent results. 
Protoplasts (1-4 X 10*) were incubated for 6h in 1 ml of mannitol buffer in 
6-well tissue culture plates (1mm depth, excluding hypoxia effects)*****” or 
submerged in 1 ml of mannitol buffer in a 1.5 ml microfuge tube (25-mm depth) 
for hypoxia treatment. Establishment of the hypoxic condition was confirmed by 
the induction of well-known hypoxia marker genes encoding ADH and PDC 
(refs. 38, 39) in qRT-PCR and microarray analyses (data not shown). These 
genes are induced under hypoxic and anoxic conditions, but not under other 
types of cellular energy stress. Unlike DIN1 and DIN6, their induction cannot be 
repressed by sugar’. For dark treatment, plates were covered with aluminium 
foil for 6h. If not otherwise indicated, DCMU, glucose and sucrose, were added 
to final concentrations of 20 11M, 25 mM and 50 mM, respectively. 

RT-PCR analysis. The fifth and sixth leaves of 4-week-old plants, protoplasts 
and one-week-old seedlings were treated with darkness (leaves and protoplasts, 
6h; leaves of VIGS plants, 10h; seedlings, 20h), DCMU (leaves and seedlings, 
100 uM, 20h; leaves of VIGS plants, 100 1M, 10h; protoplasts, 50 11M, 6h) or 
submerged in water (hypoxia treatment, leaves and seedlings, 20 h; protoplasts, 
6h) in the absence or presence of 3% sucrose, 25 mM glucose, or a protein kinase 
inhibitor K252a (2 4M). RNA samples were isolated and analysed by RT-PCR 
for DIN1 and DIN6 expression using gene-specific primers (Supplementary 
Table 7). UBQ4 was used as a control gene. Total RNA (1 jg) was converted 
to single-stranded complementary DNA by reverse transcriptase (Invitrogen). 
Quantitative real time RT-PCR (qRT-PCR) was performed using PCR Master 
Mix SYBR Green (BioRad) in a BioRad iCycler (3 min 95 °C, 40 cycles at 10s 
95°C, 45 s at 59°C, and 1 min 95°C, 1 min 55°C). In the protoplast assays, marker 
gene expression was normalized to CIPK23 (At1g30270) expression, chosen on 
the basis of its steady levels in the KIN10 and hypoxia GeneChip experiments 
(Supplementary Table 1). In the leaf assays, marker gene expression was normal- 
ized to TUB4 (At5g44340). 

Protein kinase assays. Kinase assays were performed as described” using HA- 
tagged kinases immunoprecipitated from protoplasts and the SPS peptide 
(RDHMPRIRSEMQIWSED) as a specific substrate*!. Kinase reaction mixes 
were spotted onto Whatman P81 phosphocellulose paper squares and the reac- 
tions were terminated by immersion in 75 mM phosphoric acid. Squares were 
washed twice, airdried and incorporated radioactivity was measured by liquid 
scintillation counting. 

Antibodies and protein expression analyses. To generate the KIN10-specific 
antibody, a GST-fused KIN10 polypeptide (amino acids 335-388) was produced 
in E. coli using a pGEX-4T-1 vector and GST was removed before rabbit injec- 
tion. The antibody was affinity purified using a SulfoLink matrix (Pierce). 
Phospho-KIN10 and phospho-KIN11 were detected using a commercial anti- 
body against the human phospho-AMPKa, subunit (P-AMPK, New England 
Biolabs). HA-tagged proteins were detected by an anti- HA monoclonal antibody 
(Roche) using standard techniques”. 

Transgenics. To generate transgenic KIN10 overexpression lines, the coding 
region was cloned into an expression vector derived from the pCB302 minibin- 
ary vector*’. For generating the kin10 RNAi lines, the last 238 bp of the coding 
region was cloned into pQH29 in both orientations separated by an intron (Q. 
Hall, unpublished information) and sub-cloned into the pCB302-derived 
expression vector*’. Multiple transgenic lines were generated by standard pro- 
tocols**, screened by RT-PCR and analysed for consistent phenotypes. 
Loss-of-function transgenics. No T-DNA insertion, enhancer or gene trap 
knockout lines with significantly reduced KIN10 or KIN11 expression could 


nature 


be obtained (data not shown). We also attempted to generate kin11 single and 
kin10 kin11 double mutants using the same RNAi strategy. However, despite 
multiple efforts and extensive transgenic screens, we were not able to recover 
transgenic lines, possibly because of embryonic lethality*’’. Studies in barley 
pollen” and pea seed'* have shown that antisense expression of SnRK1 causes 
male sterility and embryo maturation defects, consistent with our inability to 
obtain transgenic Arabidopsis lines with deficiency in both KIN10 and KIN11. 
Virus-induced gene silencing (VIGS). VIGS was performed as described using 
Arabidopsis wild-type, KIN10-overexpressing, and kin10 RNAi seedlings*?™. 
The control VIGS construct was generated using a 596-bp GFP fragment and the 
KIN11 VIGS construct, using a gene-specific 503-bp fragment (103 bp of the 
5’ UTRand the first 400 bp of the KIN11 coding region). The mutant plants were 
identified by RT-PCR and the VIGS experiments were repeated six times with 
consistent results. 

Plant growth and phenotype assays. Plants were grown in soil under standard 
13h light/11h dark or 20h light/4 h dark (long-day) conditions. For the various 
phenotype assays, seedlings were grown under constant light (80 WE) in liquid 
medium: (1) 0.5X MS, 0.5% sucrose, 60 r.p.m. for 7 days for dark, DCMU, and 
hypoxia treatments; (2) MS, 2% glucose, 60 r.p.m. for 7 days and transferred to 
glucose-free medium for 2 weeks, for nutrient starvation assays; (3) MS, 3% 
sucrose, 60 r.p.m. for 7 days for spectrophotometric anthocyanin quantification 
(ODs39). To test the effect of sugar availability on growth, seedlings were grown 
for 11d vertically on agar plates (0.1 MS = sucrose) under 16 h light/8 h dark 
conditions. 

Starch quantification. Starch was extracted from 0.2-0.3 g (fresh weight) of leaf 
material and quantified using a starch assay kit (Sigma). 

URLs. Genevestigator is available at http://www.genevestigator.ethz.ch/; 
MAPMAN, at_ http://gabi.rzpd.de/projects/MapMan/; NASC_ International 
Affymetrix Service at http://affymetrix.arabidopsis.info/; and TIGR MeV at 
http://www.tm4.org/mev.html. Gene annotation information is available on 
the Sheen lab website at http://genetics.mgh.harvard.edu/sheenweb/search_ 
affy.html. 
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The CDKN2b-CDKN2a locus on chromosome 9p21 in human 
(chromosome 4 in mouse) is frequently lost in cancer. The locus 
encodes three cell cycle inhibitory proteins: p15'\“*° encoded by 
CDKN2b, p16'\“** encoded by CDKN2a and p14*"* (p194"* in 
mice) encoded by an alternative reading frame of CDKN2a 
(ref. 1). Whereas the tumour suppressor functions for p16'N““* 
and p144"* have been firmly established, the role of p15'\**° 
remains ambiguous. However, many 9p21 deletions also remove 
CDKN2b, so we hypothesized a synergistic effect of the combined 
deficiency for p15'N**?, p144®* and p16"\**. Here we report that 
mice deficient for all three open reading frames (Cdkn2ab-/~) are 
more tumour-prone and develop a wider spectrum of tumours 
than Cdkn2a mutant mice, with a preponderance of skin tumours 
and soft tissue sarcomas (for example, mesothelioma) frequently 
composed of mixed cell types and often showing biphasic differ- 
entiation. Cdkn2ab-’/— mouse embryonic fibroblasts (MEFs) are 
substantially more sensitive to oncogenic transformation than 
Cdkn2a mutant MEFs. Under conditions of stress, p15‘ protein 
levels are significantly elevated in MEFs deficient for p16""“*. Our 
data indicate that p15'"*® can fulfil a critical backup function for 
pl6'™““* and provide an explanation for the frequent loss of the 
complete CDKN2b—CDKN2a locus in human tumours. 

Both p15“ and p16'N““ are able to induce cell cycle arrest in G1 
by inhibiting cyclin-dependent kinases CDK4 and CDK6 to inactivate 
the retinoblastoma (RB1) family of tumour suppressor proteins’. The 
unrelated p14“"" protein acts primarily by way of MDM2 to activate 
the key checkpoint protein TRP53, thereby inducing either cell cycle 
arrest (both in G1 and in G2) or apoptosis’. The entire locus is fre- 
quently lost or mutated in a wide range of primary tumours*®. 
Mutational analyses and studies with mouse models indicate that 
both p16“ and p144"* function as tumour suppressors*'°. For 
mae the situation is less clear, because no point mutations affect- 
ing its function and only a few cases of epigenetic silencing have been 
found*"*, Moreover, mice deficient for p15'™*” show only a very subtle 
tumour predisposition’®. The frequent loss of cyclin-dependent inhib- 
itor locus 2 (CDKN2b) in human tumours could be explained by its 
juxtaposition to CDKN2a leading to frequent co-deletion, although 
combined loss of p16“ and p15'N*® is associated with a significant 
reduction in survival of patients with soft tissue sarcomas’. 

To study the effect of additional p15'™*® loss on tumorigenesis, 
we have generated two new mouse strains, one in which both 
pis? and pl6'™* are inactivated (which we call Ink4ab~/~), 
and one deficient for p15'™*®, p16'"*** and p19“ (which we call 
Cdkn2ab-’~). The Ink4ab- and Cdkn2ab- alleles are schematically 
depicted in Fig. la. For more details see Methods and Supplemen- 
tary Fig. 1. 


Within seven months, all Cdkn2ab~/~ animals develop one or 
more tumours (Fig. 1b). The predominant lesions are skin tumours, 
including squamous cell carcinoma, basal cell carcinoma and tricho- 
epithelioma/hair-germ-cell tumours, and various soft tissue sarco- 
mas (Fig. 1c and Fig. 2). Both types of lesions are typically composed 
of different cell types, possibly reflecting oncogenic transformation of 
an early progenitor, and frequently show biphasic morphology. For a 
detailed description and histological characterization of both types of 
lesions, see Supplementary Data. Several animals developed lym- 
phoma or leukaemia. In addition, a few cases of (adeno)carcinoma 
of the intestinal tract and one case of spontaneous melanoma (onset 
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Figure 1| Generation of Ink4ab-’~ and Cdkn2ab~’~ mice. a, Schematic 
representation of the Ink4ab ’~ and Cdkn2ab’/~ mutant alleles. hyg, 
hygromycin cassette; red triangle symbol, loxP site; TAG, p16'"““*-specific 
point mutation. b, Kaplan—Meier cumulative survival curves of Cdkn2ab /~ 
(homozygous, Hom; n = 55), Cdkn2ab’* (heterozygous, Het; n = 57) and 
Cdkn2ab*’* (wild type, WT; n = 27) littermates. P < 0.001 for every pair. 
c, Tumour spectrum of Cdkn2ab~‘~ mice. d, Kaplan-Meier cumulative 
survival curves of DMBA-treated mice (dashed lines represent + DMBA). 
Blue, Ink4a‘~ (n = 15 treated, n = 23 untreated); red, Ink4ab-/~ (n = 27 
treated, n = 27 untreated). P< 0.001 for treated versus untreated animals, 
and for treated Ink4ab /~ versus treated Ink4a‘~ animals. 
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at three weeks of age) were observed. Except for skin tumours, 
Cdkn2ab*’~ heterozygous animals develop similar tumours to 
homozygous animals although they have a much longer latency 
(Fig. 1b). Ink4ab-/~ animals that are p19“"-proficient do not show 
a significant tumour predisposition within the first year (Fig. 1d). 
However, application of a single dose of dimethylbenzantrene 
(DMBA) to newborn Ink4ab”’~ pups induces the same spectrum 
of tumours with the same histological features as observed for 
Cdkn2ab"/~ mice, including skin tumours. Notably, DMBA- 
induced tumours appear much earlier and also show a much broader 
diversity in Ink4ab ’~ mice than in mice deficient in p16""“* (which 
we call Ink4a~‘~ mice), indicating that p15“? has a critical tumour 
suppressor activity (Fig. 1d). 

To analyse the effects of different Cdkn2b—Cdkn2a mutations on cell 
behaviour, MEFs from various strains were characterized using a 3T3 
protocol. For a summary of the different lines used throughout this 
study, see Supplementary Fig. 2. As expected, p19“"-deficient MEFs 
grow faster than p19“"-proficient MEFs. Combined loss of p15'"“*> 
and p16", in both p19“"-proficient and -deficient cells, confers a 


Figure 2 | Cdkn2ab~’~ mice develop skin tumours and sarcomatous 
lesions. a, A Cdkn2ab ‘~ mouse with multiple skin lesions. b, H&E staining 
(X20) of a well-differentiated squamous cell carcinoma. ¢, A detailed image 
(H&E, 40) of the lesion shown in b, showing basaloid and hair-germ-cell- 
like differentiation. d, A detailed image (H&E, X40) showing extensive hair 
germ-like differentiation. e, Mesothelioma-like lesion in the lung, pleura and 
mediastinum (H&E, X2.5). L, lung; the black arrowhead indicates the 
oesophagus. f, Mesothelioma-like lesion in the thoracic wall (H&E, <2.5). 
The white arrowheads indicate the ribs. g, Synovial sarcoma-like lesion 
surrounding the knee joint (H&E, X2.5). The black arrowhead indicates the 
cross ligament. h, Lesion surrounding the vertebral column (H&E, <5). The 
asterisks represent tumours. 
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slightly increased growth potential and a shorter cell cycle re-entry 
latency compared to loss of either p15'™*” or p16" alone (Fig. 3a,b). 

Soft agar assays showed that none of the cell lines were capable of 
anchorage-independent growth. After retroviral transduction of 
oncogenic RAS (HRAS(G12V)), only p194"-deficient MEFs formed 
colonies in soft agar (Fig. 4a). Notably, HRAS(G12V) transformation 
of Cdkn2ab-‘~ cells resulted in about tenfold more colonies than 
transformation of either Arf_ or Ink4a/Arf~ “~ cells. Moreover, both 
under normal culture conditions and in soft agar, the HRAS(G12V)- 
transformed Cdkn2ab‘~ cells grew much faster than either Arf ’~ or 
Ink4a/Arf ‘~ cells, with the first soft agar colonies apparent after 
three days. This striking difference cannot be explained by large varia- 
tions in expression levels of RAS, Cdk6 or phosphorylated (P)-Erk 
(Supplementary Fig. 5). Transduction of c-MYCalso increased colony 
growth in soft agar, although the colonies were smaller than the 
HRAS(G12V)-transformed colonies. In contrast, transformation by 
both c-MYC and HRAS(G12V) resulted in roughly equal numbers of 
soft agar colonies from Arf ’~, Ink4a/Arf ‘~ and Cdkn2ab~‘~ cells. 
These in vitro assays suggest that, in Ink4a/Arf /” MEFs, p15'™*> 
suppresses oncogenic transformation. 

To determine how p15'"*? and p16'"““* expression is correlated, 
we analysed the expression of p15'"“*? and p16" under different 
conditions. In proliferating cells, p16'"“* expression is initially very 
low and increases in subsequent passages. Expression of p15'"“*? is 
already significant at passage two, and remains stable throughout the 
culture process (Fig. 3e). In contrast, on oncogene-induced express- 
ion of p16'™**, p15'"*® expression is lost (Fig. 4d). The same cor- 
relation between p16" and p15'"“*” expression levels was observed 
in super-confluent cells (Supplementary Fig. 4a). 

Whereas p16'"““* transcription was induced on consecutive pas- 
saging or introduction of HRAS(G12V) (Fig. 3d, 4c and Supplemen- 
tary Fig. 4b), levels of messenger RNA encoding p15'"“*? remained 
stable under these conditions (Figs 3c, d, 4c and Supplementary 
Fig.4b). Thus, p16'™“** protein levels reflect RNA levels, whereas 
piso protein levels do not. Notably, treatment of HRAS(G12V)- 
transformed cells with the proteasome inhibitor MG132 causes a 2.6- 
fold increase in p15" protein levels in the presence of p16'™*, 
indicating that p15'“*® is subject to proteasomal degradation. In 
contrast, MG132 treatment did not increase p15'"“* protein levels 
in p16'"“**-deficient cells (Fig. 4e). Small variations in Cdkn2d 
(which encodes p19'"“**) mRNA and protein levels were observed, 
but these did not correlate with either genotype or passage number. 
However, in the absence of both p16" and p15'"™®, slightly higher 
levels of Cdkn2c (which encodes pis 4 mRNA and protein are 
found (Supplementary Fig. 4c, d). 

Binding of p15'"“? to Cdk4 was increased in the absence of 
pie, both in normal cells (Fig. 3f) and in HRAS(G12V)- 
transformed cells (Fig. 4f). Binding of p18'™““* to Cdk4 remained 
unchanged in the different lines and under the various conditions. 
This shows that p15'™*? is able to functionally substitute for 
pl6™***, and suggests a unique relationship, at least in MEFs, 
between p16“ and p15'™*”, 

Because RAS transformation is dependent on Cdk4 (ref. 16), we 
speculated that co-expression of an Ink4-insensitive CDK4 mutant 
(CDK4(R24C)) would eliminate the differences observed between the 
p19*"-deficient strains (Fig. 4b). Introduction of both CDK4(R24C) 
and HRAS(G12V) into MEFs increased the number of Ink4a/Arf~ i= 
and Arf ‘~ colonies more than 7.5-fold, but had hardly any effect on 
the number of Cdkn2ab~’~ colonies. We also noted that, in soft agar 
assays, the Ink4a/Ar, ‘~ MEFs were less sensitive to transformation 
than Arf “ MEFs. This could reflect differences in genetic back- 
ground and/or residual p19*" activity from the remnant exon 1f in 
the Ink4a/Arf ‘~ cells that becomes apparent under the experimental 
conditions used (for example, high levels of oncogenic RAS)’’. Also, 
we cannot exclude that p15'"“*” has functions that are not shared 
with p16“, However, our data indicate that both p16“ and 
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Figure 3 | MEF analysis. a, Proliferation curves (+s.d., n = 3). P, passage 
number. b, Cell cycle re-entry curves. Horizontal axis shows number of 
hours after the addition of serum. c, Relative mRNA levels (Taqman; +s.d., 
n= 4). Levels in wild-type (WT) cells at P3 were set at 1. d, Relative mRNA 
levels in p194"-deficient cells (Taqman; +s.d., n = 4). Levels at P3 were set at 


p1s'™*> suppress HRAS(G12V) oncogenic transformation primarily 
by means of inhibition of Cdké4 signalling. 

Taken together, our results show that loss of the complete Cdkn2b-— 
Cdkn2alocus confers a very high tumour predisposition. Mice in which 
p19"" is knocked out are primarily predisposed to fibrosarcoma and 
lymphoma after a relatively long latency'*"'. Additional loss of p16“ 


1. Asterisk, levels of mRNA encoding p16"™** were measured using a probe 


measuring both p16" and p19". e, Relative protein levels in proliferating 
cells. Passage numbers are indicated. f, Cdk4 binding. IP, anti-Cdk4- 
precipitated Ink4a/Arf ‘~ and Arf ‘~ extracts. 


loss of the second inhibitor of Cdks 4 and 6 encoded by this locus results 
in a pronounced predisposition to the development of a wide range of 
highly malignant tumours not seen in either Arf ’~ or Ink4a/Arf ‘~ 
mice. Notably, after DMBA treatment, Ink4ab~’~ mice show a much 
higher tumour predisposition than either Ink4a ’~ (Fig. 1d) or 
~ mice!’, indicating that the combined loss of p15'“*® and 


results in a closely resembling tumour phenotype”!®. Here we show that _ p16'"“** augments tumour predisposition independent of the p19“ 
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Figure 4 | Analysis of oncogenically transformed MEFs. a, Soft agar 
analysis of c- MYC (MYC)- and HRAS(G12V) (RAS)-transformed cells. 
Images show HRAS(G12V)-transformed cells after ten days of culture 
(+s.d., n = 3). b, Soft agar analysis of Cdk4(R24C)- (called R24C in image) 
and HRAS(G12V) (RAS)-transformed cells. The fold increase in the number 
of colonies in doubly transformed cells as compared to in the HRAS(G12V) 


single transformation is indicated below the graph (+s.d., n = 2). ¢, Relative 
mRNA levels (Taqman; +s.d., n = 4). Levels in mock-transformed (C) cells 
were set at 1. Asterisk, the levels of p16""““* were obtained using a probe 
measuring both p16" and p19*". d, Relative protein expression levels. 
e, Relative protein levels after MG132 treatment. The length of treatment is 
indicated (hours). f, Cdk4 binding in HRAS(G12V)-transformed cells. 
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status. The strong effect of co-deletion of p15'"“*” and p16'™* could be 
explained by attributing distinct complementing functions to these two 
proteins. We consider this to be unlikely for the following reasons. First, 
hardly any INK4b mutations not affecting INK4a have been reported. 
Second, p15'"* knockout mice have a weak tumour phenotype and 
p15'™*>_deficient MEFs show only a slight increase in sensitivity to 
transformation by oncogenic RAS (ref. 15). Third, our in vitro data 
show that p15'™** protein is degraded when p16'" is expressed. Last, 
binding of p15'"*? to Cdk4 is increased in the absence of p16'"“. 
Therefore, our results are best explained by a model in which 
p1s'™**? fulfils a ‘backup’ role for p16". 

It has been suggested that p16'“*** is the most effective inhibitor of 
CDKs 4 and 6, because only p16'*** can form stable, binary com- 
plexes with both CDK4 and CDK6 (ref. 19). However, p16"“** is 
expressed at very low levels under normal physiological conditions. 
Under conditions of stress (including oncogenic stress, ageing and 
growth inhibition), the expression of pl6'™“** increases. Should 
p1s'™*> be expressed, p16'"“** will efficiently compete for Cdk4 
(and probably Cdk6) binding, resulting in the displacement and 
degradation of the p15'“*® protein. Many cells express p15'"“*> 
either constitutively or on induction by factors such as transforming 
growth factor-B. On loss of p16'"““*, Cdk4 inhibition is maintained 
by p15'"“*®. In p194"-deficient MEFs, the loss of both p16'"““* and 
p1s'™“*> only marginally affects their growth properties, but greatly 
enhances their propensity to become transformed by oncogenic RAS. 
This might relate to the earlier reported ability of p16'““** to inhibit 
integrin-mediated cell spreading’, which could explain the early 
onset and aggressive growth of tumours in Cdkn2ab‘~ mice. 
According to our model, efficient malignant transformation would 
require the inactivation of both p16'“** and p15'“*®, thus provid- 
ing a rationale as to why the entire CDKN2a—CDKN2b locus is so 
frequently deleted. Although the other family members p18’“““« and 
p19'N*4 could have a role similar to that of p15'“*, there are no 
reports showing the combined loss of p16'\“* and either of these 
proteins. The duplication of the gene that encodes p15'N“*», which is 
considered the ancestral Ink4 inhibitor in this locus, might have 
evolved in mammals to serve a dual function’. As a result, p15'N“* 
and p16'\“* now respond to different signalling pathways, although 
acting mostly on the same target proteins. At the same time, p15'N**> 
serves as a backup of pl6"“**, thereby ensuring a more robust 
tumour suppressive capacity of this important locus. Even so, this 
remains a hazardous situation, because a single deletion could result 
in the elimination of three critical tumour suppressor activities. 


METHODS SUMMARY 


The Ink4ab~‘~ strain was obtained through Ink4b targeting in Ink4a* ES cells (in 
which there is a point mutation in exon 2 of Cdkn2a that is silent for the p19" 
reading frame but results in a stop codon in the p16'™“ reading frame)'’. The 
Cdkn2ab~/~ strain was obtained through subsequent crossing of the Ink4ab~/~ 
strain with the actin-Cre deleter strain. For tumorigenic studies, mice were killed 
when seriously ill or when visible tumours reached a diameter of 1 cm. DMBA 
treatment consisted of the application ofa single dose of DMBA (50 tl ofa solution 
of 0.5% DMBA in acetone) to the dorsal skin of newborn pups (0-3 days old). 

Tissues were fixed in either 4% formaline or acidified formalin (ethanol/acetic 
acid/formaldehyde/saline at 40:5:10:45 v/v)?!. Haematoxylin and eosin (H&E) 
stainings were performed according to standard procedures. A full list of all the 
antibodies used can be found in the Methods. 

MEFs from E13.5 embryos were isolated and cultivated according to standard 
protocols”. Cell cycle re-entry was determined after 72 h of serum starvation (0.5% 
FCS). Oncogenic transformations were performed according to standard protocols 
using the following viruses: pLZRS-ires-gfp-hygro, pLZRS-cMYC-ires-gfp, 
pLZRS-CDK4(R24C)-ires-gfp, pBabePuro and pBabePuro-HRAS(G12V). 

The immunoprecipitations were performed essentially as described in ref. 23, 
and all immunocomplexes were visualized by enhanced chemiluminescence 
(Pierce 34080). 

RNA expression analysis was performed using both Taqman and SYBR Green 
quantitative PCR methods. 
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FACS analysis was performed according to standard protocols using a 
FACSCalibur flowcytometer (Becton Dickinson) and CellQuest software 
(Becton Dickinson) for data analysis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. For generation of Ink4ab- and Cdkn2ab mouse strains, see Supplementary 
Fig. 1. The Ink4a* ES cell clone’, containing a PGK-neo selection cassette flanked 
by loxP sites 1 kbp upstream of exon 2, was used as starting material. In the 
pia targeting vector, Cdkn2b exons 1 and 2 were replaced by a PGK-hyg 
selection cassette (orientation indicated in Supplementary Fig. 1a) with a5’ loxP 
site in the same orientation as the loxP sites flanking the PGK-neo cassette 
upstream of exon 2 of Cdkn2a. Correctly targeted clones were selected by 
Southern blotting using an Xbal/BamHI probe 9 kb downstream of exon 2 of 
Cdkn2b. Subsequently, each of these clones was transfected transiently with a Cre 
recombinase plasmid to identify the clones in which the genes encoding pisink> 
and p16'"“"* were disrupted on the same allele. When the modifications are on 
different alleles, Cre-mediated deletion will result in the removal of only the 
PGK-neo cassette. In contrast, in clones containing a doubly targeted Cdkn2b- 
Cdkn2a allele, Cre activity can result in three distinct deletion products depend- 
ing on the specific set of loxP sites involved. One parental clone gave separate 
subclones in which all three of these Cre-mediated deletions were detected. From 
this parental clone, subclones in which only the PGK-neo cassette had been 
deleted (leaving one loxP site 3’ of Cdkn2b exon 2 and a second loxP site 5’ of 
Cdkn2a exon 2) were used for blastocyst injection. F; offspring were either used 
to derive a (Ink4ab ‘~) colony or were crossed with actin-Cre mice™ to generate 
germline deletion of the entire Cdkn2a—Cdkn2b locus (Cdkn2ab-’~ colony). 
Both colonies were crossed to an FVB/N background. 

For an overview of all strains used throughout this study, please refer to 

Supplementary Fig. 2. Ink4a” and Ink4a/Arf were published previously as 
Ink4a* and Ink4a“?°, respectively’. Ink4b” was generated in our laboratory 
(unpublished) and carries the same mutation as the published Ink4b knockout 
strain'’. The Arf strain was obtained from the laboratory of C. Sherr'®. The 
second Cdkn2b—Cdkn2a knockout strain briefly referred to in the Supplemen- 
tary Data is depicted as Cdkn2ab (A) in the Supplementary Fig. 2. This strain 
gave similar results as the Cdkn2ab’ (B) strain reported in this study. 
Histological analysis. Primary antibodies were obtained from the following 
sources: E-cadherin, Transduction Laboratories 610182; involucrin, Babco 
PRB-140C; K1, Babco PRB-165P; K8, Developmental Studies Hybridoma 
Bank Troma-1; K10, Sigma C7284; K14, Babco PRB-155P; loricrin, Babco 
PRB-145P; and vimentin, Research Diagnostics RDI-PROGP-53. The mesothe- 
lin antibody was a gift from T. Bera. Antibody staining was revealed using either 
diaminobenzidine or 3-amino-9-ethylcarbazole chromogen. Sections were 
counterstained with haematoxylin. Images were captured using a Zeiss 
Axioskop2 Plus microscope (Carl Zeiss Microscopy) equipped with a Zeiss 
AxioCam HRc digital camera, and processed using AxioVision 4 software 
(Carl Zeiss Vision). 
Cell culture. Cells were grown at 37 °C, 5% CO2, in DMEM supplemented with 
10% FCS, 100 units per ml penicillin, 100 ppgml" streptomycin and 0.1mM 
B-mercapto-ethanol. Cells were serially passaged twice a week according to the 
3T3 protocol. Plates were scraped and pellets were stored at —80 °C for future 
protein and RNA analysis (see below). 

For cell cycle re-entry experiments, cells (passage 2) were serum-starved (0.5% 
FCS) for 72 h. One hour before analysis, BrdU was added to the medium (10 UM 
final concentration). Cells were collected by trypsinisation, washed, fixed in 70% 
ethanol and stored at 4°C until further analysis. 

For soft agar growth analysis, cells (24,000 cells per well in a 6-well plate) were 
resuspended in 0.4% soft agar and plated on a bottom layer of 1% agar. The 
number of colonies growing in soft agar after two weeks of culture was quantified 
using ImagePro Plus 5.1.2 software (MediaCybernetics). 

For oncogenic transformation, passage two MEFs were infected overnight 
with pLZRS-ires-gfp-hygro control virus, with pLZRS-cMYC-ires-gfp virus 
expressing human c-MYC or with pLZRS-CDK4(R24C)-ires-gfp expressing 
the non-INK4 binding mutant of CDK4. After recovery during the day, the cells 
were infected overnight with either pBabePuro control virus or pBabePuro- 
HRAS(G12V) virus expressing human HRAS(G12V). Puromycin selection 
(4 ug ml) was started 24h after the second infection and maintained for one 
week. Viruses were provided by D. Peeper and R. van Amerongen. 
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To inhibit proteasome-mediated protein degradation, HRAS(G12V)- 

transformed Arf’ and Ink4a/Arf ‘cells were treated with either 201M 
MG132 or ethanol for 4, 6 or 10h. 
Protein expression analysis. MEF pellets were lysed in RIPA buffer and the 
lysates were cleared by centrifugation at 20,800g for 10 min. Proteins were 
separated on 4—12% NuPAGE Novex Bis-Tris gels (Invitrogen), transferred to 
nitrocellulose (Schleicher and Schuel) or Immobilon (Millipore) membrane 
and probed with antibodies. Immuno-complexes were visualized by enhanced 
chemiluminescence (Pierce 34080) using goat anti-mouse (AMI3404, 
Biosource) or goat anti-rabbit (ALI0404, Biosource) antibodies coupled to 
horseradish peroxidase. 

Cells were harvested by scraping and were incubated in lysis buffer (50 mM 
HEPES, pH7.5, 100mM NaCl, 2mM EDTA, 0.5% NP-40, 10% glycerol, 
0.1mM Na3VOq, 0.5mM Naf, 0.1 mM PMSF, 2.5 pg of leupeptin per ml and 
1mM DTT) for 30min on ice. After overnight incubation with anti-Cdk4 at 
4°C, the immune complexes were recovered with protein G-agarose beads (1 to 
2h, 4°C) and washed twice with lysis buffer. The immune complexes were 
resolved on a 10% SDS-PAGE gel and electrophoretically transferred to nitro- 
cellulose membranes. Membranes were blocked in PBS containing 0.1% Tween 
20 and 5% instant milk, and were incubated with antibody for 2h at room 
temperature (18-21 °C) in PBS containing 0.1% Tween 20. Proteins recognized 
by the antibody were detected by enhanced chemiluminescence using a horse- 
radish peroxidase-coupled secondary antibody (Pierce). 

The primary antibodies used were: p16'™*, M-156 (Santa Cruz); p15!">, 
K-18 (Santa Cruz); pisinkts, M-168 (Santa Cruz); piginktd, M-167 (Santa Cruz); 
p19", R562 (Abcam); o-tubulin, B512 (Sigma); RAS, 3965 (Cell Signaling); 
P-Erk, 9182 (Cell Signaling); Cdk4, sc-260 (Santa Cruz); and Cdk6, 3136 (Cell 
Signaling). 

RNA expression analysis. RNA was isolated from frozen MEF pellets using 
RNEasy columns (Qiagen). For each sample, 1 ug mRNA was reverse transcribed 
using random hexamers (Roche, first Strand cDNA synthesis kit for RT-PCR). 
Samples were analysed at least in triplicate on a minimum of two different plates. 
Reactions were performed using the ABI Prism 7000 sequence detection system. 
Taqman quantitative PCR analysis. Reactions were performed according to 
manufacturer’s instructions using Taqman Universal PCR Master Mix 
(Roche) and the equivalent of 10ng of RNA per reaction. The following 
probe/primer mixes were used: Cdkn2b (pis) Mm00483241_m1; Cdkn2a 
(pl6™* + p94") _Mm00494449_m1; Cdkn2a (p19) Mm01257348_m]; 
Cdkn2c (p18'™*) Mm01258411_g1; Cdkn2d (p19'"™*) Mm00486943_m1; 
and Gapdh internal control (Mm99999915_g1). 

SYBR Green quantitative PCR analysis. Reactions were performed according to 
manufacturer’s instructions using Power SYBR Green PCR Master Mix (ABI) 
and the equivalent of 50 ng of RNA per reaction. The following primers were 
used: p15'™®, AGATCCCAACGCCCTGAAC and CCCATCATCATGACC- 
TGGATT; p16™*, CGTACCCCGATTCAGGTGAT and TTGAGCAGAAGA- 
GCTGCTACGT; p19", GCCGCACCGGAATCCT and TTGAGCAGAAGA- 
GCTGCTACGT; and Hprt internal control, CTGGTGAAAAGGACCTCTCG 
and TGAAGTACTCATTATAGTCAAGGGCA. 

FACS analysis. BrdU labelling and propidium iodide staining were performed 
according to standard protocols. Briefly, ethanol-fixed cells were washed (0.5% 
Tween-20 in PBS), the nuclei were dissociated with HCI/Triton (5 M HCl, 0.5% 
Triton) for 20min at room temperature, after which 10 volumes of 0.1M 
Na2B407 were added for neutralization. The cells were collected by centrifu- 
gation, washed again and stained with anti-BrdU-Alexa488 (1:100, Invitrogen 
A21303) for 30 min at room temperature in wash buffer containing 1% BSA. 
After two more wash steps, the cells were resuspended in FACS buffer (PBS 
containing 1% BSA) containing 10 1g ml! propidium iodide. Cells were mea- 
sured using a FACSCalibur flowcytometer (Becton Dickinson) and analysed 
using CellQuest software (Becton Dickinson). 


24. Lyons, S. K., Meuwissen, R., Krimpenfort, P. & Berns, A. The generation of a 
conditional reporter that enables bioluminescence imaging of Cre/loxP- 
dependent tumorigenesis in mice. Cancer Res. 63, 7042-7046 (2003). 
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The effects of molecular noise and size control on 
variability in the budding yeast cell cycle 


Stefano Di Talia’, Jan M. Skotheim’, James M. Bean’t, Eric D. Siggia* & Frederick R. Cross’ 


Molecular noise in gene expression can generate substantial vari- 
ability in protein concentration’. However, its effect on the pre- 
cision of a natural eukaryotic circuit such as the control of cell 
cycle remains unclear. We use single-cell imaging of fluorescently 
labelled budding yeast to measure times from division to budding 
(G1) and from budding to the next division. The variability in G1 
decreases with the square root of the ploidy through a 1N/2N/4N 
ploidy series, consistent with simple stochastic models for molecu- 
lar noise. Also, increasing the gene dosage of G1 cyclins decreases 
the variability in G1. A new single-cell reporter for cell protein 
content allows us to determine the contribution to temporal G1 
variability of deterministic size control (that is, smaller cells 
extending G1). Cell size control contributes significantly to G1 
variability in daughter cells but not in mother cells. However, even 
in daughters, size-independent noise is the largest quantitative 
contributor to G1 variability. Exit of the transcriptional repressor 
Whi5 from the nucleus partitions G1 into two temporally un- 
correlated and functionally distinct steps. The first step, which 
depends on the G1 cyclin gene CLN3, corresponds to noisy size 
control that extends G1 in small daughters, but is of negligible 
duration in mothers. The second step, whose variability decreases 
with increasing CLN2 gene dosage, is similar in mothers and 
daughters. This analysis decomposes the regulatory dynamics of 
the Start transition into two independent modules, a size sensing 
module and a timing module, each of which is predominantly 
controlled by a different G1 cyclin. 

In budding yeast (Saccharomyces cerevisiae), smaller cells delay the 
Start transition in late G1, which could imply that cell cycle initiation 
has a deterministic requirement for a critical size or translation 
rate’ >. However, in apparent contradiction to deterministic models, 
the timing of G1 in budding yeast shows substantial variability that is 
independent of cell size*°. Here, we analyse the conflicting roles of 
deterministic cell size control and molecular noise in gene express- 
ion’”'® in generating and controlling cell cycle variability. 

We first address the hypothesis that gene expression noise’ gen- 
erates significant cell cycle variability. Doubling ploidy doubles the 
number of all cellular components, but cell volume also doubles, so 
the average concentrations of cellular components remain con- 
stant''. If stochastic variation in the numbers of key molecules causes 
gene expression noise, then doubling their average number along 
with the ploidy should reduce this noise (expressed as standard devi- 
ation divided by the mean) by about ,2 (ref. 12). If cell cycle timing is 
controlled by noisy gene expression, then doubling ploidy should 
similarly reduce cell cycle variability. 

We measured times from cytokinesis to budding (G1) and from 
budding to cytokinesis in haploids, diploids or tetraploids (mothers 
and daughters), using time-lapse fluorescence microscopy of 
strains expressing Myol tagged with green fluorescent protein 


(Myo1-GFP). Myol forms a ring at the new bud neck"* (concomitant 
with initiation of DNA replication’), which disappears at cytokin- 
esis’* (Fig. 1). G1 variability is reduced in both mothers and daugh- 
ters by about the expected factor of 2 for each ploidy doubling 
(Fig. 1, Supplementary Table 8, Supplementary Fig. 7). Thus molecu- 
lar noise might be an important source of G1 variability. In marked 
contrast, variability in the time from budding to cytokinesis is nearly 
unaffected by ploidy (Supplementary Table 8). 

The magnitude and sensitivity to ploidy of G1 noise indicate that 
the noise might be due to small variable numbers of key regulatory 
molecules. As G1 cyclins control average G1 duration’*** and are 
expressed at a level of few messenger RNA transcripts per cell’, we 
integrated tandem arrays of the G1 cyclin genes CLN2 or CLN3 in 
haploids. Increasing copy numbers of G1 cyclin genes decreases G1 
variability (Fig. 1, Supplementary Table 10), which could help to 
explain the reduction in variability that is caused by the presence 
of multiple copies of the entire genome (Fig. 1). 

This analysis is based solely on timing; however, cell size has long 
been proposed as a deterministic regulator of the Start transition*”’. 
To provide a protein-based single cell marker of cell size, we placed 
the DsRed Red Fluorescent Protein under the control of the pro- 
moter of the constitutive, strongly expressed actin (ACT1) gene. 
Assuming that the DsRed transcript accumulates and is translated 
in parallel with bulk cellular mRNA, then total red fluorescence per 
cell will reflect total cell protein content. When we quantified total 
red fluorescence per cell as described”, we found exponential growth 
in single cells (Fig. 2a; Supplementary Information), as deduced 
previously from pulse-labelling of size-selected populations”. The 
single-cell growth rate « is moderately variable, but its average agrees 
well with the bulk culture growth rate (Fig. 2b, Supplementary Table 
12). Total red fluorescence scales linearly with ploidy (Supplemen- 
tary Table 11) and with geometric estimates of cell size. However, 
using our methods, DsRed fluorescence is a more reliable indicator of 
cell size than geometric volume estimation (see Supplementary 
Information). Total red fluorescence for a colony increases exponen- 
tially (Fig. 2c), so changes in the microenvironment do not interfere 
with these measurements. These results support the use of total red 
fluorescence from ACT1pr—DsRed as a single-cell marker for cell size. 

Size control at Start would require smaller cells to prolong G1 for 
growth, thereby linking birth size and G1 duration. Given exponen- 
tial growth, the size at budding, Myua; is related to the size at birth 
Mbpirt through the amount of time spent in G1 by the simple for- 
mula: Myung = Mbinne?“, where o is the growth rate for exponential 
growth and TG, represents the overall duration of G1. This expres- 
sion yields: «To; =In(Myua)—In(Myirtn). Plotting correlations 
between &T.; and In(Mpirtn) allows us to distinguish between two 
classical concepts for G1 control: timers and sizers**’’. IfG1 duration 
is under the control of a timer, then «T, will be independent of cell 


'The Rockefeller University, ?Center for Studies in Physics and Biology, The Rockefeller University, New York, New York 10021, USA. +Present address: Human Oncology and 
Pathogenesis Program, Memorial Sloan-Kettering Cancer Center, New York, New York 10021, USA. 
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Figure 1| Noise in G1 duration is reduced by increased ploidy or increased 
G1 cyclin gene dosage. a, Composite phase contrast, Myol—GFP and 
ACT 1pr—DsRed images for haploid cells. b, Illustration of measured 
intervals. cl, Frequency histograms (n = 87-202) of the duration of G1 for 
wild-type (WT) haploid (c, h), diploid (d, i) and tetraploid (e, j), haploid 
6XCLN3 (f, k), and haploid 6X CLN2 (g, I), daughters (c, d, e, f, g) and 
mothers (h, i, j, k, 1). Insets: mean and coefficient of variation (CV: s.d./mean, 
a standardized noise measure). 
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size at birth, and the slope of the linear fit of the plot of «Tq against 
In(Myirth) will be 0. By contrast, if G1 is controlled by a sizer, all cells 
will bud at the same size M,ug, independent of their size at birth, 
implying that the slope of the linear fit of xT, against In( Myirtn) will 
be —1 (ref. 27). 

For the following analysis, rigorous statistical testing of fits is 
described in the Supplementary Information. 

Scaled G1 duration in mother cells is essentially independent of cell 
size (slope ~ —0.1), showing that mother G1 is controlled by a timer 
(Fig. 2d, Supplementary Fig. 9). Daughters, by contrast, show 
stronger size control (slope ~-0.4). Binning the daughter data 
(Fig. 2e inset) suggested decomposition into two segments, one for 
small newborn daughters (<67% of the average budding size), in 
which an efficient sizer was deduced (slope ~ -0.7), and a second 
segment for larger-born daughters, which showed much less depend- 
ence on cell size (slope ~ —0.3; Fig. 2e). Statistical confidence in this 
decomposition was limited by the small number of very small daugh- 
ters obtained; therefore, we employed the genetic method described 
in ref. 17 to make unusually small wild-type daughter cells by tran- 
sient expression of conditional MET3-—CLN2 (see Supplementary 
Information). Inclusion of these data (Fig. 2f) provided strong stat- 
istical support for the two-slope model (linear fit: P< 0.05; two-slope 
fit: P> 0.7). 
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Figure 2 | The correlation between cell size and G1 duration shows that a 
noisy size control operates in daughters. a, Logarithm of total DsRed 
fluorescence (M) per cell in a single representative cell from birth to 
cytokinesis. Doubling time is In(2)/a, where « is the slope of the linear fit. 
b, Haploid cell doubling time distribution. c, Total DsRed fluorescence in an 
entire colony over time. d, e, Correlation between «Tg; (growth-rate- 
standardized time in G1) and In(M) for haploid mothers (d) and daughters 
(e) at birth (In(Mpirtn)). Insets, binned data. f, Data from e (solid blue dots), 
supplemented with data from unusually small wild-type haploid daughters 
(open green circles), generated using essentially the method of ref. 17. For 
statistical analysis and estimated slopes, see Supplementary Information. 
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Efficient size control ensures that all cells bud at the same size. As 
there is variability in cell size at birth, an efficient sizer would ensure 
that smaller cells spend longer in G1, generating cell-to-cell variabil- 
ity in G1 duration. Measuring individual growth rates and cell sizes 
allows G1 variability to be decomposed into variability due to size 
control and a size-independent residual that is attributable to 
molecular noise. Assuming that G1 duration for an individual cell 
is the sum of a deterministic function of cell size at birth, f(Mbirtn), 
and a stochastic variable, 7, then: “To: = f(Mbirtn)+ 1, where 
f(Mpirtn) is obtained empirically by binning data. For a measured 
distribution of sizes at birth, the variance of f(Mbirtn) yields the 
amount of Gl variability that is produced by size control. Size- 
independent variability is the average distance between a data point 
and the deterministic f(Mpirth)- 

This decomposition shows that size-independent (presumably 
molecular) noise is the leading source of variability in the duration 
of G1 in both mothers and daughters (Table 1). However, size control 
accounts for about 30-40% of overall G1 variability in daughters. 
Consistent with our previous timing analysis, size-independent noise 
decreases by about a factor of 2 for each ploidy doubling (Table 1; 
Supplementary Figs 7, 9, 10); has, ploidy-dependent noise reduction 
is robust to statistical removal of all detectable size and growth 
rate effects, indicating that molecular noise is likely to explain size- 
independent variability. The dosage of the genes for G1 cyclins also 
decreases size-independent variability; therefore, some of this vari- 
ability might be attributable to stochastic variation in the expression 
of G1 cyclins themselves, although other targets cannot be excluded. 

We can divide G1 into two steps on the basis of nuclear residence of 
Whi5**”*, which enters the nucleus late in mitosis and exits during 
G1” (Fig. 3a, b). Whi5 represses the transcription of genes that are 
activated by the SBF/MBF transcription factors*, including CLN1 
and CLN2. Cln3 initiates the inactivation and nuclear exit of 
Whi5***°. We call the period between cytokinesis and Whi5 exit 
T,, and the period between Whi5 exit and budding T. In mother 
cells, T, is only ~1 min (Table 2), so the ~15 min length of G1 in 
mothers is mainly determined by T>. By contrast, in daughter cells T, 
and T, are about 20 and 17 min, respectively. As T, and T> are 
uncorrelated (Fig. 3c), Whi5 exit marks the boundary between two 
independent steps. For daughters, plotting «T), the time from birth 
to Whi5 nuclear exit scaled with the growth rate «, against the log- 
arithm of the size at birth, yields a nearly identical relationship to that 
for overall G1 duration «T,, (Figs 2e and 3d, e), shifted down owing 
to growth during T;. The indicated two-slope model fits these data 


Table 1| Decomposition of G1 variability into a deterministic size control 
term and a residual attributable to molecular noise 


G1 noise, birth 


Noise due to size Size and growth 


to budding control (percentage of | rate-independent 
(coefficient of the total noise (percentage 
variation of variance of %Tg,) of the total variance 
aTe1) of «T.1) 

Daughters 
Wild-type haploids 0.55+0.06 0.31+0.04(32) 0.45 + 0.04 (68) 
Wild-type diploids 0.42+0.04 0.28+0.03 (45) 0.31 +£0.03 (55) 
Wild-type tetraploids 0.24+0.02 0.15+0.01 (39) 0.19 +0.02 (61) 
Haploid 6x CLN2 0.48+0.04 0.30+0.03 (39) 0.37+0.03 (61) 
Haploid 6xCLN3 0.44+0.04 0.25+0.02 (32) 0.36+0.03 (68) 
Haploid 6XCLN3 6XCLN2 0.37+0.03 0.18+0.02 (24) 0.32 + 0.03 (76) 
Mothers 
Wild-type haploids 0.50+0.05 0.20+0.02(16) 0.46 + 0.04 (84) 
Wild-type diploids 0.39+0.04 0.13+0.01(11) 0.37 40.04 (89) 
Wild-type tetraploids 0.26+0.02 0.09+0.01(12) 0.24+0.02 (88) 
Haploid 6X CLN2 0.33+0.03 0.13+0.01(16) 0.30+0.03 (84) 
Haploid 6xCLN3 0.48+0.05 0.16+0.02(11) 0.45+0.04 (89) 
Haploid 6XCLN3 6XCLN2 0.34+0.03 0.17+0.02 (25) 0.29 + 0.02 (75) 


G1 noise (first column): coefficient of variation of «Tg, (x, growth rate; Tg, G1 duration). G1 
noise is decomposed into size-dependent and size-independent components (second and third 
columns); in parentheses, the percentage of the variance of «Tg, accounted for in each column. 
Noise in «7 is the square root of the sum of the squares of the two independent noise 
contributions. 
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Figure 3 | The correlation between cell size and the regulation of Whi5 nuclear 
residence supports decomposition of Start into a size-control module and an 
independent timing module. a, Composite phase contrast, WhiS—GFP and 
ACT 1pr—DsRed images for haploid cells. Whi5—GFP is mostly observed in new- 
born daughter cells. b, Diagram of the measured intervals; Tg) (Fig. 1) is 
approximately T, + T>. c-h show only data for daughters. ¢c, T; and T, are 
uncorrelated (correlation coefficient —0.1) (data from ref. 24; WT, wild-type). 
d, xT, against In(Myirtn). ¢, Binned data from Fig. 3d (green points, black line) 
and Fig. 2e (blue points, red line). f, Binned data for xT, (green points, black line) 
and «Tg (blue points, red line) against In(Mpirtn) for cells grown in glycerol/ 
ethanol (g/e; see also Supplementary Fig. 3). g, xT, and «Tg against In(My itn) 
for 6X CLN3 cells. h, xT) against In(Mpirtn) for whi5 cells. i, Model decomposing 
Start into a size-control module and an independent timing module unaffected 
by cell size. For statistical analysis, see Supplementary Information. 
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Table 2 | Average durations of T, and T2 in different strains 


Wild type cln3 6x CLN3 6x CLN2 
T, indaughters 20+41(157) 30+4(47) 10+1(53) 16 + 1 (80) 
T; in mothers 0.9+0.3(170) 1341(55) 0.7+05(56) 1.1+0.4 (90) 
T2in daughters 1742(157) 14+2(47) 16+1(53) 11+ 1 (80) 


14.7+0.6 (170) 13+41(55) 13.6+0.8 (56) 12.0 + 0.6 (90) 


T,, period from cytokinesis to Whi5 nuclear exit; T2, period from Whi5 exit to bud emergence 
(see Fig. 3b). Mean = s.e.m. in min (number of observations in parentheses). 


T2 in mothers 


significantly better than a one-slope model, and the deduced slopes 
for the Whi5 data and for the total Gl data are similar (see 
Supplementary Information). Thus, G1 size control is restricted to 
T,, the period of Whi5 nuclear residence. T,, the part of G1 after 
Whi5 exit, is independent of cell size, and similar in mothers and 
daughters” (Table 2). 

These observations are robust to changes in nutrient conditions: 
growth of cells in glycerol/ethanol instead of glucose, resulting in 
slow growth and generation of very small newborn daughter cells, 
gave quantitatively similar results (Fig. 3f; see Supplementary Infor- 
mation). As for the small daughters produced by transient MET3— 
CLN2 expression (Fig. 2f), the combined data sets for glycerol/ 
ethanol and glucose strongly supported a two-slope model for total 
daughter G1 and Whi5 nuclear residence times (see Supplementary 
Information). Even in glycerol/ethanol, mother cells showed little or 
no size control (Supplementary Fig. 6). 

Increasing CLN3 gene dosage decreases T,, whereas increasing 
CLN2 gene dosage modestly decreases T> (Table 2, see Supplemen- 
tary Information). Deletion of CLN3 significantly increased T, in 
both mothers and daughters, so the shortness of T; in wild-type 
mothers is ClIn3-dependent (Table 2). Efficient size control, indicated 
by the steep slope for small wild-type daughters, is essentially elimi- 
nated by an increase in CLN3 gene dosage or deletion of WHI5 
(Fig. 3e, g, h). 

Changing the copy numbers of CLN3 as compared with CLN2 had 
differential effects on G1 variability in mothers and daughters (Fig. 1). 
The two-step model explains this, as an increase in CLN3 copy num- 
ber should only affect the first step, which is slow in daughters but 
very rapid in mothers. As, in mothers, G1 is temporally dominated by 
the second step, mother cell G1 variability is more sensitive to 
changes in CLN2 copy number (Fig. 1). Consistent with this idea 
and with independence of the two steps, combining 6x CLN3 
together with 6x CLN2 in one haploid genome resulted in low G1 
variability in both mothers and daughters (Table 1, Supplementary 
Fig. 8). 

Therefore, we decompose Gl into two independent steps 
separated by Whi5 nuclear exit: a size-sensing module and a size- 
independent timing module. The first step depends on both Cln3 and 
cell size, and the second step depends on Cln2, but not on cell size or 
Cln3 (Fig. 3i). Temporal variability in the first step is due to the 
natural variability in cell size at birth coupled with size control, as 
well as molecular noise, possibly due to variability in CLN3 express- 
ion. The duration of the second step is cell-size independent; its 
variability is affected by the expression of the G1 cyclin CLN2, one 
of the primary final effectors of Start'*'’. This analysis demonstrates 
that molecular noise has a role in generating variability in a cellular 
transition, and provides a precise quantitative framework for the 
analysis of size control. 


METHODS SUMMARY 

Strain and plasmid constructions. Standard methods were used throughout. All 
strains are W303-congenic. All integrated constructs were characterized by 
Southern blot analysis. 

Time-lapse microscopy. Cells were prepared for time-lapse microscopy as 
described**. We observed growth of microcolonies with fluorescence time-lapse 
microscopy at 30°C using a Leica DMIRE2 inverted microscope with a Ludl 
motorized XY stage. Images were acquired every 3 min for cells grown in glucose 
and every 6 min for cells grown in glycerol/ethanol with a Hamamatsu Orca-ER 
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camera. We used custom Visual Basic software integrated with ImagePro Plus to 
automate image acquisition and microscope control. 

Image analysis. Automated image segmentation and fluorescence quantifica- 
tion of yeast grown under time-lapse conditions, and semiautomated assign- 
ment of microcolony pedigrees, were performed as described”. Budding was 
scored visually by the appearance of Myol—-GFP at the incipient bud neck, and 
division by its disappearance, generally with single-frame accuracy. The nuclear 
residence of Whi5—GFP was scored by visual inspection of composite phase 
contrast—fluorescent movies. Cell size was measured as the total cell fluorescence 
from DsRed protein, expressed from the constitutively active ACT] promoter. 
Background was measured as the average fluorescence of unlabelled cells for each 
movie and subtracted from the measured pixel intensities. To combine data from 
different experiments, we normalized our measurements of red fluorescence to 
the average red fluorescence at budding computed for each colony separately. 
Similar results were obtained by normalizing the size of cells by using the average 
intensity computed for each colony separately. 

Data analysis. Time-lapse microscopy data were analysed with custom software 
written in MATLAB. Binning was performed by marking off nonoverlapping 
intervals on the horizontal axis, averaging the data in each interval and plotting 
in the centre of the interval. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 12 March; accepted 6 July 2007. 
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METHODS 

Strain and plasmid constructions. Standard methods were used throughout. All 
strains are of the W303 background. The plasmid pSD03 (pRS403—CLN2) was 
obtained by cloning the Smal—Sfol fragment containing CLN2 genomic DNA 
obtained from the Yep24—CLN2 2,1 plasmid (J. McKinney, unpublished data) at 
the Smal site in pRS403. The CLN2 genomic fragment started 1.4 kb upstream of 
the CLN2 open reading frame and ended about 8 kb downstream. A homologous 
recombination pop-out of the Ty1 (retrotransposon) downstream of CLN2 was 
found in the original Yep24-CLN2 21 plasmid. pSD03 was integrated at the HIS3 
locus by BglII digestion. Strain $D27-1-1A was confirmed by Southern blot 
analysis to have five duplicative integrations of pSD03. The plasmid pJBO6T 
(pRS404—-ACTIpr—DSRED) was obtained as follows. The ACTIpr (~500 bp 
upstream of the ACT1 open reading frame) was inserted into pTY24 (obtained 
from NCRR Yeast Resource Center, University of Washington) just upstream of 
DSRED coding sequence. The BamHI-BglII fragment containing ACTI1pr— 
DSRED was then inserted at the BamHI site in pRS404. Plasmid pSD02 
(pRS406—ACT1pr—DSRED) was obtained by cloning the Bgl fragment contain- 
ing ACT1pr—DSRED obtained from pJBO6T into the Bg/l fragment containing 
URA3 obtained from pRS406. pSD02 was integrated at the URA3 locus by Ncol 
digestion. Strain SD20-1A was confirmed by Southern blot analysis to have two 
duplicative integrations of pSD02. All the other strains that carry pSD02 were 
obtained by crosses with SD20-1A or with strains derived from it so they also 
have two duplicative integrations of pSD02. We observed that strains trans- 
formed with one copy of pJBO6T or two copies of pSD02 behave identically with 
the only difference that the average intensity of the second reporter is about twice 
as large as the average intensity of the first one, as expected by difference in copy 
number. A MYO1-GFP strain’ was backcrossed at least five times to W303 to 
obtain the strains used in this paper. 

Time-lapse microscopy. Cells were prepared for microscopy and time-lapse 
microscopy as described. We observed growth of microcolonies with fluor- 
escence time-lapse microscopy at 30 °C using a Leica DMIRE2 inverted micro- 
scope with a Ludl motorized XY stage. Images were acquired every 3 min for cells 
grown in glucose and every 6 min for cells grown in glycerol/ethanol with a 


nature 


Hamamatsu Orca-ER camera. We used custom Visual Basic software integrated 
with ImagePro Plus to automate image acquisition and microscope control. 
Image analysis. Automated image segmentation and fluorescence quantifica- 
tion of yeast grown under time-lapse conditions, and semiautomated assign- 
ment of microcolony pedigrees, were performed as described**. Budding and 
division were scored by visual inspection for the appearance and disappearance 
of the Myol-GFP signal, respectively. Detection of the appearance of the Myol- 
GFP signal was facilitated by thresholding the images and plotting the pixels 
above threshold on a linear grey scale. The nuclear residence of Whi5—GFP was 
scored by visual inspection of composite phase contrast-fluorescent movies and 
confirmed by the method described for Myol—GFP detection. Scoring of cell 
cycle events by monitoring the localization of Myol-GFP and Whi5-GFP is 
much faster than detection of newly synthesized, cell-cycle-regulated fluorescent 
proteins and allows a much higher temporal resolution. Cell size was measured 
as the total cell fluorescence from DsRed protein, expressed from the constitu- 
tively active ACT1 promoter. Background was measured as the average fluor- 
escence of unlabelled cells for each movie and subtracted from the measured 
pixel intensities. The measurement of cell size at given time points displayed an 
appreciable variability, probably due to experimental noise in the imaging and 
errors in the segmentation of cell bodies. The effect of this noise on size measure- 
ments was efficiently averaged out by extracting cell size at a given time point 
from the least-square fit to exponential growth. The average fluorescent intensity 
of identical strains imaged on different days and with different fluorescent lamps 
displayed variability much larger than natural variability among cells in the same 
colony. To combine data from different experiments we normalized our mea- 
surements of cell size to the average size at budding computed for each colony 
separately. Similar results were obtained by normalizing the size of cells by using 
the average intensity computed for each colony separately. 

Data analysis. We analysed time-lapse microscopy timing and fluorescent data 
with custom software written in MATLAB. Binning was performed as follows: 
multiple points were placed in bins and averaged. The averages and the values of 
the centre of the bin were then plotted. Similar results to that obtained by binning 
the data were obtained by smoothing the data with a moving average. 
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Antidepressant binding site in a bacterial homologue 
of neurotransmitter transporters 


Satinder K. Singh’, Atsuko Yamashita*t & Eric Gouaux!* 


Sodium-coupled transporters are ubiquitous pumps that harness 
pre-existing sodium gradients to catalyse the thermodynamically 
unfavourable uptake of essential nutrients, neurotransmitters and 
inorganic ions across the lipid bilayer’. Dysfunction of these integ- 
ral membrane proteins has been implicated in glucose/galactose 
malabsorption’, congenital hypothyroidism’, Bartter’s syndrome’, 
epilepsy’, depression’, autism’ and obsessive-compulsive disorder®. 
Sodium-coupled transporters are blocked by a number of thera- 
peutically important compounds, including diuretics’, anticon- 
vulsants’® and antidepressants'', many of which have also become 
indispensable tools in biochemical experiments designed to probe 
antagonist binding sites and to elucidate transport mechanisms. 
Steady-state kinetic data have revealed that both competitive’®’’ 
and noncompetitive'’*’ modes of inhibition exist. Antagonist dis- 
sociation experiments on the serotonin transporter (SERT) have 
also unveiled the existence of a low-affinity allosteric site that slows 
the dissociation of inhibitors from a separate high-affinity site’’. 
Despite these strides, atomic-level insights into inhibitor action 
have remained elusive. Here we screen a panel of molecules for their 
ability to inhibit LeuT, a prokaryotic homologue of mammalian 
neurotransmitter sodium symporters, and show that the tricyclic 
antidepressant (TCA) clomipramine noncompetitively inhibits 
substrate uptake. Cocrystal structures show that clomipramine, 
along with two other TCAs, binds in an extracellular-facing ves- 
tibule about 11 A above the substrate and two sodium ions, appar- 
ently stabilizing the extracellular gate in a closed conformation. 
Off-rate assays establish that clomipramine reduces the rate at 
which leucine dissociates from LeuT and reinforce our contention 
that this TCA inhibits LeuT by slowing substrate release. Our 
results represent a molecular view into noncompetitive inhibition 
of a sodium-coupled transporter and define principles for the 
rational design of new inhibitors. 

LeuT is a stable, sodium-coupled leucine transporter from the 
eubacterium Aquifex aeolicus and is the only member of the neuro- 
transmitter sodium symporter (NSS or SLC6) family of secondary 
transporters that has so far been amenable to structural analysis’’. 
Eukaryotic NSS counterparts include those that pump neurotrans- 
mitters such as serotonin, norepinephrine, dopamine, glycine and 
y-aminobutyric acid (GABA) from the synapse to neuronal and glial 
cytoplasms, shaping the magnitude and duration of synaptic signal- 
ling'’. The crystal structure of LeuT provided the first molecular 
glimpse into an NSS member”, but it did not yield any clues as to 
the atomic basis of inhibition. Motivated by this dearth of knowledge, 
together with the key role of NSS proteins and their antagonists in 
synaptic transmission, we sought to find inhibitors of LeuT and to 
characterize their mechanism of action by using flux, binding and 
crystallographic methods. 


We began by screening a wide variety of NSS inhibitors, including 
those that target the transporters of glycine (GlyT), GABA (GAT), 
dopamine (DAT), norepinephrine (NET), and serotonin (SERT), for 
their ability to inhibit the uptake of 1-[*H]Leu by purified LeuT 
reconstituted into lipid vesicles. The tricyclic antidepressant (TCA) 
clomipramine was the most potent inhibitor of uptake (Fig. 1a), 
whereas the structurally related compounds, imipramine and desi- 
pramine, were less effective (Fig. la, Supplementary Fig. 1a). In dose— 
response experiments (Fig. 1b), clomipramine was a modestly potent 
inhibitor of leucine transport, with a half-maximal inhibitory con- 
centration (IC;,) approximately eightfold lower than that of imipra- 
mine (250 versus 2,090 11M; Supplementary Table 1). To test whether 
the TCAs could displace bound leucine, we conducted radioligand 
binding experiments using detergent-solubilized LeuT, [°H]Leu, and 
potential cold competitors. Unlike amino acids such as leucine, ala- 
nine and tryptophan, none of the TCAs (at 1 mM) significantly dis- 
placed [PH]Leu from LeuT (Supplementary Fig. 1b), tentatively 
excluding competitive inhibition as the mechanism by which the 
TCAs inhibit LeuT. 

To define the mode of inhibition by clomipramine more precisely, 
we next performed a series of steady-state kinetic experiments. With 
(°H]Leu as the substrate, clomipramine unambiguously reduced the 
maximum velocity of uptake at infinite concentration of substrate 
(Vinax)» although there was no clear correlation with the Michaelis 
constant (K,,; Supplementary Fig. 1c). This might be because at 20 °C 
LeuT exhibits a high affinity for leucine (dissociation constant 
(Ka) = 20 nM) and turns it over slowly, with a catalytic rate constant 
(kat) of only 1.2 hr (S.K.S., C. Piscitelli, A-Y. & E.G., manuscript in 
preparation), thus obscuring the distinction between simple binding 
and transport in a standard flux assay (Supplementary Fig. 1d). 
However, L-alanine is also a substrate of LeuT (Supplementary Fig. 
1d), it binds with a Kg at least 15-fold greater than that of leucine, and 
it has koa, and Ky, values ~5-fold greater and ~3-—4-fold greater, 
respectively, than the corresponding values for leucine (S.K.S, C.P., 
A.Y. & E.G., manuscript in preparation). Importantly, clomipramine 
is amore potent inhibitor of PH]Ala transport with an ICs9 of ~5 uM 
(Supplementary Fig. le, Supplementary Table 1). With [*H]Ala as the 
substrate and clomipramine as the inhibitor, the Michaelis-Menten 
plot (Supplementary Fig. 1f) showed a decrease in Vinax with no 
change in K,,, and the Eadie—Hofstee graph (Fig. 1c) had parallel lines 
with increasing clomipramine concentrations, both unequivocal 
indicators of noncompetitive inhibition (Supplementary Table 1). 
Crystallographic analysis confirmed that the LeuT—alanine-sodium 
complex adopts the same occluded state as the leucine complex (PDB 
code 2A65)'’, except for minute alterations in the substrate binding 
pocket (S.K.S., C.P., A.Y. & E.G., manuscript in preparation), validat- 
ing the use of [*H]Ala in the kinetic experiments. 
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Where does clomipramine bind and what is the molecular basis for 
its inhibition of substrate transport? To address these questions, we 
solved cocrystal structures of the leucine- and sodium-bound state of 
LeuT in complex with clomipramine, desipramine or imipramine as 
well as the alanine- and sodium-bound state of LeuT in complex with 
clomipramine. All four crystal structures refined well (Ree = 21.0- 
21.9%) using data measured to 1.7-1.9 A resolution (Supplementary 
Table 2). Simulated-annealing F,—F, omit maps revealed the saddle- 
shaped electron density expected for the dibenzazepine ring, and 
for clomipramine there was a clear protrusion at the 3-position 
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Figure 1| Functional analysis of LeuT and effect of inhibitors and TCAs. 
a, Inhibition of -[*H]leucine transport by a panel of eukaryotic NSS 
substrates and inhibitors, all at 1 mM. NFPS, N[3-(4’-fluorophenyl)-3- 
(4phenylphenoxy) propyl] sarcosine; Coc. anal., cocaine analogue 3-«-[(4- 
chlorophenyl)phenylmethoxy]tropane. b, Dose dependence of inhibition by 
clomipramine (CMI; filled circles) and imipramine (IMI; open circles) of 
(?H]Leu transport. ¢, Steady-state kinetics of inhibition by clomipramine of 
L-[*H]alanine transport (Eadie—Hofstee plot). Open circles, filled circles, 
open squares and filled triangles represent data for 0, 1, 20 and 50 uM 
clomipramine, respectively. [S] is the substrate concentration and vel. is the 
velocity of substrate uptake. Error bars represent s.e.m. of triplicate 
measurements. 
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pinpointing the location of the chlorine atom (Fig. 2a), a feature 
absent in the density associated with both imipramine (Fig. 2b) 
and desipramine (data not shown). Clomipramine (Fig. 2c), along 
with the other TCAs (Supplementary Fig. 2), binds to LeuT in the 
extracellular-facing vestibule that probably forms part of the 


Figure 2 | TCAs bind in the putative permeation pathway of LeuT. F,—F. 
simulated annealing omit maps, both contoured at 3.0 o, of clomipramine 
(a) and imipramine (b), in which the TCA was omitted from the simulated 
annealing run and subsequent phase calculation. The chlorine atom of 
clomipramine is green. ¢, LeuT tilted ~15° from the membrane plane to 
illustrate the binding sites of clomipramine and leucine, both depicted in 
space-filling representation. Helices whose residues interact with 
clomipramine are coloured. d, Electrostatic properties of the LeuT 
extracellular-facing vestibule, with clomipramine (yellow) cradled in the 
negatively charged crevice. Electrostatic potential isocontours were set at 
+7kTe™ (blue) and -7 kT e™ (red). EL4 is tinted green. To make 
clomipramine more visible, LeuT has been tilted an additional ~5° towards 
the reader from the view in ¢ (~20° from the membrane plane). 
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permeation pathway, lying between the extracellular solution and the 
occluded substrate- and ion-binding pocket located halfway across 
the membrane bilayer. The TCAs are cradled by the carboxy-terminal 
half of transmembrane helix 1 (TM1), the amino-terminal regions of 
TM6 and TM10, the approximate midpoint of TM3, and the sharp 
turn of extracellular loop 4 (EL4), about 11 A above the substrate and 
two sodium ions. Although most of the residues lining the TCA- 
binding pocket are nonpolar (Fig. 3a), essentially complementing 
the hydrophobic character of the TCAs, an electrostatic calculation 
shows that the binding site has a pronounced negative potential that 
probably enhances the binding of positively charged inhibitors 
(Fig. 2d). Clomipramine is further stabilized by a salt bridge between 
its protonated’? N2 atom and the side-chain carboxylate (O61) of 
Asp 401 (2.9 A) as well asa polar interaction between its chlorine atom 
and the side-chain amide (Ne2) of Gln 34 (3.1 A; Fig. 3a). Neither 
imipramine nor desipramine possesses a chlorine atom and thus they 
cannot participate in a similar interaction with Gln 34, which might 
partly explain why they are less potent inhibitors. There is a second, 
weak TCA binding site on the lipid-exposed face of the “V’ formed 
between TM4 and TM5, near the intracellular surface (Supplementary 
Fig. 3), but because this site has a low occupancy and is situated far 
from the substrate-binding site and probable transport pathway, we 
believe it is not functionally relevant. 

The conformation of the LeuI-TCA complexes is similar to that 
determined in the original complex with two crucial exceptions. 


fs H,O0 


Figure 3 | Clomipramine-binding site. a, Stereo view, with clomipramine 
and leucine carbon atoms depicted in yellow, sodium ions in purple and two 
water molecules in red. Residues whose interactions with clomipramine are 
hydrophobic, polar (Q34) or ionic (D401) are coloured grey, orange and 
pink, respectively. Atoms depicted as spheres interact through either a 
hydrogen bond or a salt bridge. b, F,-F. simulated annealing omit map, 
contoured at 3.0 o, of D404 and R30, depicting the direct salt bridge between 
the guanidium of R30 and the carboxylate of D404, with displacement of two 
water molecules (compare to overlay in d). Distances (A) are shown along 
dashed lines. c, F,-F. simulated annealing omit map, contoured at 3.0 o, of 
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The first is localized to the guanidium group of Arg 30, an amino 
acid which, along with Asp 404, forms a highly conserved charged 
pair that comprises part of the proposed extracellular gate. In the 
LeuT-leucine-sodium complex, this arginine and aspartate interact 
indirectly via two water molecules. In the TCA complexes, the gua- 
nidium group forms a direct salt bridge with Asp 404, displacing the 
intervening water molecules (Fig. 3a, b, d). The TCAs now comple- 
tely shield the salt bridge between Asp 404 and Arg 30 from aqueous 
solution, thereby lowering the local dielectric constant and presum- 
ably increasing the strength of the ion pair. The new position of the 
Arg 30 guanidium group is further buttressed by a cation-1 inter- 
action with Phe 253 (Fig. 3e), a residue that directly contacts the 
bound substrate. Significantly, the flipped position of the guanidium 
group does not prevent Arg 30 from engaging in a hydrogen-bonding 
network that ultimately connects it to Nal and the carboxylate of the 
substrate, thereby physically linking the TCA- and substrate-binding 
sites. Intervening atoms include one of the displaced water molecules, 
Oel and Ne2 of Gln250, Oe1 and Os2 of Glu 290, and Oy1 of 
Thr 254, a series of interactions that might thermodynamically cou- 
ple the binding of substrate and clomipramine, as supported by our 
observation that clomipramine exhibits different inhibitory poten- 
cies for [*H]Ala and [*H]Leu transport. 

The second change is localized to the sharp turn at Ala 319 in EL4 
(Fig. 3a, c, d), a region that exhibits conformational mobility during 
the transport cycle in GAT”, DAT”, GlyT1b” and SERT**. In the 


e \EL4 


H,O (CMI) 
©H,0 (CMI) 


the tip of EL4 (residues A317—N321), illustrating the movement ‘up’ of A319 
in the LeuT-clomipramine crystal structure (compare to overlay in 

d). d, Same view as in a with residues from the original LeuT structure (PDB 
ID 2A65) overlaid (in green with two water molecules between D404 and R30 
in cyan) onto those from the LeuT—clomipramine crystal structure (in pink 
with the two displaced waters in red and labelled as HyO (CMI)). e, CPK 
rendering of clomipramine, R30 and F253 illustrating how the positively 
charged guanidium group is sandwiched between the aromatic rings of 
clomipramine and F253. 
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leucine-sodium complex, the methyl group of Ala319 points 
towards the cavity occupied by the TCAs. In the clomipramine com- 
plexes, however, Ala 319 is flipped towards the extracellular side of 
the transporter, to accommodate binding of the inhibitor (Fig. 3c, d). 
In the imipramine and desipramine complexes, Ala 319 points down- 
wards, as in the leucine-sodium and alanine-sodium complexes, but 
the residues in the sharp turn (Ala 317—Ala 319) are shifted ‘up’ by as 
much as 1.4 A (Supplementary Fig. 4a, b). This shift provides a sec- 
ond mechanism by which movement of EL4 participates in TCA 
binding in LeuT. 

The location of the TCA-binding site in the extracellular-facing 
cavity of LeuT and on top of residues that have been implicated as 
part of the extracellular gate suggests a simple mechanism by which 
these molecules might inhibit substrate transport. We postulate that 
the occluded state of LeuT, as visualized in the leucine and sodium or 
the alanine and sodium complexes, is an essential intermediate in the 
translocation cycle and that the TCAs stabilize this occluded state, 
thereby trapping the substrate on the transporter and impeding its 
release to either the intracellular milieu or back to extracellular solu- 
tion (Fig. 4a). If this hypothesis is correct, then substrate should dis- 
sociate from LeuT more slowly in the presence of clomipramine. We 
used [*H]Leu for these experiments because PPH]Ala dissociates too 
quickly to accurately measure its off-rate (data not shown). A saturat- 
ing concentration of clomipramine does markedly decrease the off- 
rate of [*H]Leu by as much as 700-fold (Fig. 4b, c and Supplementary 
Table 1). Moreover, fitting the measurements in the absence of clo- 
mipramine to a double as opposed to a single exponential yields a 
statistically superior fit (P= 0.0003), perhaps because substrate can 
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unbind by way of either the extracellular or intracellular pathways. In 
contrast, the off-rate data in the presence of clomipramine are most 
adequately fit by a single exponential function (P = 0.6162). 

How, in molecular detail, does clomipramine slow the release of 
bound leucine? An explanation can be presented in terms of a vari- 
ation of the alternating access mechanism”, in which the transporter 
assumes at least three conformational states: open-to-outside, 
occluded, and open-to-inside. Access to either side of the membrane 
bilayer is controlled by extracellular and intracellular gates. In the 
occluded LeuT structure, in which both gates are closed, access to the 
substrate-binding site from the extracellular side is blocked by only a 
few residues, whereas access from the cytoplasmic side is obstructed 
by ~25 A of ordered protein structure. 

Clomipramine and related compounds probably retard the unbind- 
ing of substrate through the extracellular pathway by directly stabil- 
izing the salt bridge between Arg 30 and Asp 404, an ionic ‘latch’ that in 
turn stabilizes key aromatic residues, Tyr 108 and Phe 253, immedi- 
ately above the bound substrate (Fig. 4d). Clomipramine might also 
slow the rate of leucine release through the intracellular pathway by 
allosterically precluding movement of TMla and TM6b, perhaps 
together with other intracellular proximal regions such as TM3 and 
TMB, portions of the transporter that are likely to rearrange on sub- 
strate release (Fig. 4d). TM5 and TM10 might also participate in this 
transition to the cytoplasmic-facing form, as has been suggested from 
accessibility studies on SERT**”®. We speculate that clomipramine 
might prevent the extracellular-facing cavity from compacting during 
interconversion to the intracellular-facing form. Such a condensation 
in the extracellular-facing cavity has been postulated to occur in 
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Figure 4 | Clomipramine traps substrate on LeuT. a, Hypothetical graph 
depicting the free energy changes occurring along the transport cycle in the 
absence (solid lines) and presence (dotted lines) of clomipramine. 

b, Dissociation kinetics of [7H]Leu from LeuT on a 15-fold dilution into 
buffer containing 10 mM cold t-leucine and either 0 (closed circles; 

kore: = 85.9 + 18.1h! (62.0 + 4.9%), kop = 12.1 + 4.2 hh! (33.5 + 4.8%)) 
or 3mM (open circles; kg = 0.12 + 0.02h™') clomipramine. 


c, Magnification of the 0 mM CMP dissociation reaction, with the single and 
double exponential fits depicted in grey dotted and black solid lines, 
respectively. All data to 60 min were included in the fitting, but only the first 
10-min segment is displayed. Error bars represent s.e.m. of duplicate 
measurements. d, Schematic illustrating a mechanism for noncompetitive 
inhibition of LeuT by the TCAs. 
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SERT***”’, where the extracellular inhibitor cocaine is thought to 
prevent such a compaction from occurring to preclude subsequent 
opening of the cytoplasmic permeation pathway”. 

What relationship do our studies on LeuT have to the mechanism 
by which the TCAs inhibit NET and SERT, the targets of these drugs 
in humans? LeuT is clearly related in amino acid sequence to these 
biogenic amine transporters and, therefore, the three proteins must 
share a common architecture that probably includes a large extra- 
cellular-facing vestibule—a cone-shaped cavity of both hydrophobic 
and hydrophilic character that can serve as a binding site for mole- 
cules as diverse as TCAs and snail peptide toxins**. However, the 
sequences of SERT and NET still diverge substantially from that of 
LeuT and, thus, the details of inhibitor binding are likely to differ. 
Recent data on SERT” and NET” indicate that the TCAs probably 
occupy a primary site that is deeper in the protein core, close to or 
overlapping with the occluded substrate-binding pocket identified in 
the LeuT structures. Nevertheless, by demonstrating that molecules 
bound within the extracellular vestibule can exert noncompetitive 
inhibition of transport via stabilization of the occluded state, we have 
pinpointed an additional region that might be exploited for the 
development of new inhibitors with therapeutic potential. 


METHODS 

Preparation of LeuT proteoliposomes. LeuT was expressed and purified as 
described’’, and liposomes were prepared with a 3:1 (w:w) ratio of Escherichia 
coli total lipid extract and egg phosphatidylcholine in buffer I (20 mM HEPES- 
Tris (pH7.0), 100mM potassium gluconate). They were treated with Triton 
X-100 at a 2.5:1 (w:w) ratio of lipid to detergent before reconstituting with 
LeuT at protein-to-lipid ratio of 1:110 (w:w) and removing excess detergent 
with BioBeads. LeuT proteoliposomes were then loaded with buffer I and stored 
at —80 °C until needed. The efficiency of protein incorporation into lipid vesicles 
approached 100%, as assessed by a modified amido black assay”*. This value was 
then used in all functional experiments to estimate what quantity of LeuT to use 
in a specific assay and to convert specific transport activity expressed as pmol 
[°H]Leu per min to specific activity expressed as pmol [*H]Leu per min per mg 
protein. Proteolysis experiments established that ~60% of the LeuT is oriented 
with the N and C termini inside the liposome. 

Details of the inhibition screens and time courses as well as the dose response, 

steady-state inhibition kinetics, and dissociation kinetics experiments are pro- 
vided in Supplementary Information. 
Crystallography. LeuT was expressed and purified as described’? and then 
divided into three 30-40-l aliquots. TCAs were added to a final concentration 
of 10 mM, incubated on ice for 15 min and then subjected to crystallization trials 
by hanging drop vapour diffusion. For the LeuT—alanine-clomipramine com- 
plex, protein was purified as outlined for the LeuT-leucine-TCA complexes 
except for the presence of 100 mM alanine in all buffers. Crystals grew within 
7-10 days at 18 °C in the presence of 17-22% PEG 550 MME, 100 mM HEPES- 
NaOH (pH7-7.5) and 200mM NaCl. They were cryoprotected by gradually 
increasing the starting PEG 550 MME concentration to 35%. 

Details of data collection and structure determination are provided in 
Supplementary Information. 
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Tissue Issues 


Millions of tissue samples have been collected and archived, but researchers wanting to explore them at 
the molecular level have found it tough going. Nathan Blow investigates the issues. 


According to experts, there are more than a bil- 
lion tissue samples archived in hospitals and 
tissue banks around the world, most of them 
formalin-fixed and paraffin-embedded (FFPE). 
Today, these samples present both an incredible 
opportunity and a huge challenge to research- 
ers. FFPE tissue samples have been extensively 
annotated and well preserved, allowing detailed 
study of the progression of diseases such as 
cancer. But due to the method of preservation, 
obtaining biomolecules from these samples is 
proving difficult, to say the least. 

FFPE was first described more than 100 years 
ago, and most hospitals still use this method 
today. But there has never been a set of stand- 
ardized guidelines for processing FFPE tissue 
samples taken from patients to preserve tissue 
histology, let alone biomolecules. And although 
it works well for histology, the lack of standard- 
ized guidelines seems to have hampered the 
use of FFPE samples in molecular analyses. 
This may soon change, as pathologists are 
working towards standardizing FFPE sample 
preparation, and companies and researchers 
are developing the technology needed to iso- 
late biomolecules and tap into the vast treasure 


chest of archived samples. 
Although FFPE tissue prep- 
aration is simple in theory, 
many problems associated 
with downstream molecular 
applications — such as PCR 
or microarray analysis — can 
arise. “This is all about the fact 
that there has been no attention 
paid to uniformity of prepara- 
tion,” says David Rimm, a 
pathologist at Yale University. 
Between hospitals the time to 


tissue fixation and even the _ tissue analysis. 


David Rimm: time is key in 


enzymes degrading and modi- 
fying both. 

It seems obvious that rapid 
fixation is the answer, but no 
simple solution is in sight. 
Rimm says that for a researcher 
interested in only DNA or RNA, 
rapid fixation using quick freez- 
ing methods, instead of chemi- 
cal fixation, is probably best. 
But if you are interested in pro- 
teins, freezing is problematic as 
the subsequent thawing process 
tends to break up and denature 


3 


method of fixation can vary 
dramatically. 

“I would say that the biggest issue is time 
from ligation of circulation to fixation,” says 
Rimm. During this period of ischaemia, 
molecular changes occur that cause problems 
in obtaining biomolecules. “Phosphorylation 
is very sensitive to ischaemic times. There 
seems to be promiscuous phosphotases in the 
cell that knock phosphates off tyrosines dur- 
ing this period,” says Rimm. DNA and RNA 
can also suffer damage before fixation, with 


proteins and, like ischaemia, 
can lead to modifications. 

Then there are pathologists who think that 
standardizing the time to fixation will prove to 
bea difficult task. “Standardization would be 
great, but I don't know how realistic it is across 
institutions or even within institutions,” says 
Christine Iacobuzio-Donahue, a pathologist 
at Johns Hopkins University School of Medi- 
cine in Baltimore, Maryland. She believes that 
the best place to tackle the problems caused by 
ischaemia is actually downstream of fixation, 


FROZEN IN TIME 


Analysing proteins from tissue 
samples that have been formalin- 
fixed and paraffin-embedded 
(FFPE) can provide critical 
information about how cells 
function before fixation. “When 
you formalin-fix the protein, you fix 
it in time; it is not going anywhere,” 
says Peter Tunon, vice-president of 
sales and marketing at Expression 
Pathology in Gaithersburg, 
Maryland. Researchers are taking 
advantage of this fact to explore 
the protein world in FFPE samples. 


Caught on camera: a section of human muscle 


for broad-based 
screening of 

proteins in FFPE 
samples. Other 
companies also 


all reagents phosphorylation and acetylation, 
are completely can also be observed when 
compatible examining proteins isolated 

with mass- from FFPE samples. PTMs can 
spectrometry be critical to the role of a protein 
instrumentation, in the cell, changing the function 
says Tunon, or localization. “We do see post- 
making it a good translational modifications that 
starting point are preserved on the peptides, and 


they seem to be present in ratios 
that are similar to those we get 
when using fresh frozen tissues,” 
Tunon says of work done by 
Expression Pathology. 

Even much older archival 


Expression Pathology was 
founded in 2001 by researchers 
from the US National Cancer 
Institute and the company Life 
Technologies (now Invitrogen) 
who had experience in studying 
gene expression in tissue and 
histology. “The company was 
founded on the fact that examining 
protein expression is crucial to 
understanding what is happening 
in cells," says Tunon. To this end, 
Expression Pathology has worked 


after dual histochemistry. 


think that protein 


on ways of extracting and isolating 
proteins from FFPE tissue samples 
for analysis by mass spectrometry 
or reverse-phase dot blot 

arrays. The company developed 

a technology that integrates 
extraction of total proteins 

from FFPE samples with tryptic 
digestion so that finished samples 
are ready for mass spectrometry. 
To simplify the system further, 


isolation from 

FFPE tissue samples will provide 
valuable information. QIAGEN 
and EMD Chemicals, both based 
in San Diego, California, now offer 
systems that chemically reverse 
formalin crosslinking to isolate full- 
length proteins for applications 
such as western blotting and 
protein arrays. 

Surprisingly, post-translational 
modifications (PTMs) such as 
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samples of FFPE tissues do not 
seem to pose a problem for protein 
extraction. “We have worked 

with samples that are more 

than 15 years old and found no 
difference in the profiles compared 
to samples that were just a few 
weeks old,” says Tunon. He is quick 
to add that he believes even older 
samples could be examined, but 
Expression Pathology has yet to 
test this idea. N.B. 
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MOLECULAR DEVICES 


after the biomolecules have been isolated. 
“Once you have the samples and have extracted 
the biomolecules you are interested in, then 
you can perform quality control to determine 
if all your samples are similar,’ she says. 

Even if the optimal time to fixation were 
found and standardized, this is only part of 
the problem; researchers still need to identify 
the best fixative to preserve both histology and 
biomolecules. Formalin, the most widely used 
fixative, provides good preservation of tissue 
histology but can cause problems for research- 
ers interested in downstream molecular appli- 
cations. “Formalin tends to crosslink the tissue 
to such an extent that it is hard to get fragments 
of DNA that are longer than, maybe, a hun- 
dred or two hundred base pairs,’ says Rimm. 
Formalin fixation causes the crosslinking of 
DNA as well as RNA and proteins, although 
proteins seem to fair better following formalin 
treatment (see ‘Frozen in time’). 


Preservation society 

Although tissue preservation has its problems, 
the good news is that these are now being dealt 
with. Governmental agencies, such as the US 
Office of Biorepositories and Biospecimen 
Research at the National Cancer Institute 
(NCI) in Bethesda, Maryland, are starting to 
tackle the difficult issue of biospecimen stand- 
ardization. In June, the office released a guide 
to the NCI’s best practices for biospecimen 
resources, detailing technical guidelines for 
NCI-supported biospecimen collection and 
storage. And researchers and companies are 


The DASL assay can be processed on two different platforms: the Array Matrix (left) or the BeadChip. 


now creating methods to use the degraded and 
modified biomolecules obtained from FFPE 
samples for molecular analysis. 

Isolation of RNA or DNA from FFPE tissue 
samples can be accomplished using a number 
of methods or kits. The main problem is that, 
almost without fail, the RNA or DNA isolated 
is degraded and chemically modified. But 
because researchers want to tap into the vast 
archives of FFPE tissues for global expression 
analysis and biomarker discovery, this is spur- 
ring companies to address the issue. 

Microarray analysis has proven a valuable 
tool for understanding global gene-expression 
patterns. But using the degraded messenger 
RNA obtained from FFPE tissues for such anal- 
ysis is problematic. “The results on standard 
microarrays are currently unsatisfactory,’ says 
Shawn Baker, scientific product manager for 
gene expression at Illumina in San Diego, Cali- 
fornia. For this reason, Illumina offers a gene- 
expression application that can study RNA 


extracted from FFPE tissue samples. Called 
DASL, short for cCDNA-mediated annealing, 
selection, extension and ligation assay, this 
system amplifies the mRNAs from FFPE- 
extracted samples, but unlike other amplifi- 
cation systems it is not 3’ biased, says Baker. 
“We use a combination of random and oligo dT 
primers to generate the complementary DNA, 
which means that even with degraded RNA it 
still amplifies quite well and produces good, 
consistent profiles.” Following the first amplifi- 
cation, the DASL system uses two gene-specific 
probes to amplify the cDNA. The resulting 
cDNA can be hybridized to a DASL-specific 
array. “The DASL assay is multiplexed up to 
1,536 genes. So you get a tremendous boost in 
the overall throughput,” says Baker. 

Although unable to survey as many genes 
as standard microarrays, Baker says that using 
the DASL system, researchers have been able 
to profile FFPE tissue samples that are up to 
30 years old — demonstrating the potential to 
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THE CUTTING EDGE 


Researchers have the opportunity 


a slide to a collection vessel. 


to play surgeon — slicing 
and dissecting out specific 
sections of tissue or even cell 
populations — with laser-capture 
microdissection (LCM). This 
level of analysis might seem to be 
difficult when applied to formalin- 
fixed and paraffin-embedded 
(FFPE) tissue samples, but many 
companies are now offering easy 
and quick LCM solutions. 

Leica Microsystems of 
Wetzlar, Germany, offers the 
LMD6000 LCM system, which 
uses an upright microscope for 
dissection and capture. Christoph 
Horlemann, the company’s 
product manager for the 
LMD6000, says this is possible 
because the transport mechanism 
for capture is based on gravity, 
unlike other LCM systems on 
the market. 

‘Transport mechanism’ refers 
to the method for delivering a 
dissected tissue sample from 
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Other approaches to this process 
include that of PALM Microlaser 
Technologies in Bernried, 
Germany, which uses a ‘pressure 
catapult’ to send sections into 
tubes from the LCM instrument, 
and the CapSure system from 
Molecular Devices in Sunnyvale, 
California, which uses the laser to 
extend a polymer onto the tissue 
sample for capture. 

Leica Microsystems’ transport 
mechanism uses slides with a foil 
covering. The tissue is attached to 
the foil covering but not the slide, 
so researchers cut the tissue and 
the foil together and the sample 
simply falls by gravity into the 
collection tube below, explains 
Horlemann. But even this foil 
coating might not be needed in 
the future. 

Both Leica and PALM are 
working with Expression 
Pathology of Gaithersburg, 
Maryland, on what may be 


The Arcturus XT from 
Molecular Devices can 
use either ultraviolet or 
infrared laser light for 
tissue dissection. 


the next generation of LCM 
slides. Called Director, these 
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glass slides are based on laser- 
induced forward transfer (LIFT), 
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examine the numerous collections of archived 
FFPE samples. Illumina is now working to 
increase the number of genes analysed for each 
DASL assay. 

NuGEN Technologies in San Carlos, Cali- 
fornia, is another company developing meth- 
ods to use RNA extracted from FFPE tissue 
samples for gene-expression analysis. NuGEN 
specializes in working with very small amounts 
of RNA or difficult-to-use RNA, such as RNA 
extracted from whole blood or very degraded 
sources. This is commonly seen with clinical 
samples, but is most significant in the case of 
FFPE, says Gianfranco de Feo, senior director 
of customer solutions at NuGEN. Although 
NuGEN did not start offlooking at RNA from 
FFPE tissue samples, it was the next logical step. 
“We have had a product on the market for over 
a year now that allows users to work with very 
degraded RNA in very limited amounts, down 
to 500 picograms. We built on that technology 
to create kits for the much more degraded RNA 
that comes from FFPE samples,’ says de Feo. 

At the core of NuGEN’s technology is its 
amplification and labelling system, which has 
been optimized to work with Affymetrix 3’ 
microarrays. The system relies on a combination 
ofrandom hexamers, similar to that of Illumina, 
augmented with oligo (dT) primers to convert 
mRNA into cDNA in a linear amplification 
process. The inclusion of oligo (dT) primers was 
essential because the Affymetrix arrays probe 
the 3’ ends of transcripts. But NuGEN hopes 
to have labelling and hybridization protocols 
and products for other microarray platforms 
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available before the end of the year. 

To determine how well these degraded RNA 
samples from FFPE tissue will work on micro- 
arrays, NuGEN developed a tool using quanti- 
tative real-time PCR (qPCR) assay. It turns out 
that the results from this assay correlate very 
well with the overall results of microarray analy- 
sis, says de Feo, allowing researchers to decide 
whether the data that could be obtained from 
the array will be of sufficient quality to continue. 
And this is critical information, as in some cases 
less than 50% of the transcripts on the array may 
hybridize with the amplified RNA. 


Think small 
Asuragen in Austin, Texas, is a new company 
on the commercial block, working to under- 
stand and characterize the biological role of 
small RNAs. Although founded only a year 
ago, Asuragen’s RNA roots go much deeper. 
Asuragen is a spin-off of Ambion, a company 
that worked in the field of molecular biology 
with a focus on RNA for nearly 17 years. “At 
Ambion, we developed the first kits and tech- 
nologies for characterizing small RNAs,” says 
Gary Latham, associate director of technology 
development at Asuragen. “And when micro- 
RNAs emerged as a new class of regulatory 
RNAs in humans, we were sitting in an excel- 
lent position to explore this area of ‘biological 
dark matter.” In March 2006, Ambion was 
sold to Applied Biosystems for US$273 mil- 
lion and a portion of those proceeds were used 
to fund Asuragen. 

Asuragen has concentrated its efforts on the 


Leica Microsystems offers the LMD6000 for 
laser capture microdissection applications. 


diagnostic and therapeutic opportunities of 
microRNAs (miRNAs). “As they are smaller, 
miRNAs tend to survive the more tortuous 
conditions of FFPE tissue processing better 
than mRNAs,’ says Latham, which makes 


anon-contact microdissection 
technique that uses a thin energy- 
transfer coating that replaces 
plastic films or adhesives. The 
technology was co-developed 
by researchers at the US 
Naval Research Laboratory in 
Washington DC and scientists 
from Expression Pathology. 

Laser energy is transferred to 
the transfer layer and the layer 
is vaporized. The laser energy is 
then converted into kinetic energy, 
and the selected feature is shot 
instantly into a collection tube. 

LIFT works equally well whether 
the tissue is collected up (PALM) 
or down (Leica) as the energy is 
sufficient to propel the tissue into 
the collection tube either way. 
As the transfer layer completely 
absorbs the laser energy, the 
biomolecules in the sample are 
not affected. The use of glass with 
this coating has other advantages 
as well. 

"Glass slides are quite useful as 


you can also perform fluorescence 
and contrast applications 
without any interfering foil,” says 
Horlemann. 

Molecular Devices, a division 
of MDS Analytical Technologies, 
acquired Arcturus in 2006 
and is now providing both the 
Veritas and the Arcturus XT LCM 
systems. Whereas most LCM 
systems use an ultraviolet laser 
for cutting samples, the systems 
offered by Molecular Devices 
can have either ultraviolet or the 
standard infrared laser options. 

“The infrared laser is ideal for 
small areas where a user is looking 
to pick up only a cell or two,” says 
Steven Blakely, product manager 
for the Arcturus LCM systems. 
“Infrared allows for a gentle 
collection of cells.” 

The use of infrared laser light 
is critical to the CapSure system 
because the dye found in the 
CapSure polymer is activated and 
becomes adherent with infrared 


light. Blakely says the CapSure 
system is particularly useful for 
FFPE samples that are already 
mounted on glass slides because 
the transport mechanism involves 
only the polymer adhering to the 
tissue and so can work with any 
glass slide. 

Cutting tissue samples with 
ultraviolet light offers advantages 
too, such as dissecting thicker 
samples. Leica has worked 
to optimize objectives for its 
LCM systems with high-energy 
ultraviolet-light transmission, 
allowing more power to come 
in direct contact with the tissue 
sample. Horlemann says this 
provides faster cutting of the 
thicker tissue samples while using 
less power. 

The future of LCM might just lie 
in automation and increases in 
throughput. “Now, researchers 
want to do faster, automated 
microdissection for proteomics,” 
says Horlemann. But before 
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anything else, some hardware 
and software issues needed to 
be resolved. 

Any automated software 
package has to control all 
microdissection steps including 
focusing the microscope, 
recognizing cells of interest, 
focusing the laser and defining 
the area to cut — definitely 
not a simple task, but one that 
companies have worked on and 
made tremendous strides in 
recent years. Horlemann says 
that new advances are making 
automation easier every day. 

He points to the Director slides, 
which can allow for contrasting 
methods and fluorescence, 
making it easier when attempting 
to automatically define cells of 
interest as astep in the right 
direction. 

Although simpler now, further 
advances will be required before 
LCM becomes as easy as pushing 
a button and walking away. _—N.B. 
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Direct action: Expression Pathology’s Director slide (left) in action, and tissue samples before (middle) 
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and after laser capture microdissection with the system. 
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degradation less of an issue. This means that 
downstream applications, such as microarrays, 
are better suited to miRNAs isolated from 
FFPE samples than mRNAs, says Latham. “We 
have found that there is tremendous diagnos- 
tic potential for microRNAs as biomarkers of 
disease states,” he adds. This shows how com- 
panies are moving from traditional RNA analy- 
sis to non-traditional methods that use FFPE 
tissue samples. 

Several other companies also offer technolo- 
gies to explore RNA isolated from FFPE tissue 
samples. Panomics in Fremont, California, 
has developed a direct hybridization method 
using branched DNA technology. The system, 
called QuantiGene FFPE, is unique because 
it does not involve the linear amplification of 
RNA, as do many other systems. Instead, the 
use of branched DNA technology permits the 
direct measurement of RNA from the sample 
source. The Paradise System, developed by 
Arcturus Biosciences and now supplied by 
Molecular Devices of Sunnyvale, California, 
has been optimized for RNA extraction from 
FFPE samples, followed by a linear amplifica- 
tion step prior to use in qPCR or microarray 
applications. 

Even though much headway has been made 
in the molecular analysis of FFPE tissue sam- 
ples over the past few years, high-through- 
put solutions to examine the billions of 
archived FFPE tissue samples are 
still needed. But advances in tech- 
nologies, including tissue micro- 
arrays (TMAs) and laser-capture 


# 


Tissue microarrays containing single spots of tissue (inset) 


provide a high-throughput histology solution. 
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microdissection (see “The cutting edge’), are 
signalling that high-throughput analysis might 
be around the corner. 

In 1998, Juha Kononen and his colleagues 
described a tissue sampling method that pro- 
duced regular-sized spots that could be densely 
packed on a microarray slide (J. Kononen et 
al. Nature Med. 4, 844-847; 1998). Using this 
methodology, archival FFPE tissues can be 
sampled onto TMAs, allowing researchers to 
examine numerous samples by techniques 
such as fluorescence in situ hybridization and 
immunohistochemistry on a single microscope 
slide. Nucleic acids and proteins can even be 
extracted from archival FFPE tissue TMAs. 
Best of all, the technology for constructing 
TMAs is readily available from commercial 
suppliers. 


Spot the tissue 

Beecher Instruments of Sun Prairie, Wiscon- 
sin, produces both manual and automated 
tissue microarrayers. With the Manual Tissue 
Arrayer II, a block can be directly attached to 
a microtome for sectioning of arrays, whereas 
the automated arrayer, ATA-27, accepts nearly 
all tissue cassettes and can be adapted for either 
large arrays or variously shaped archival sam- 
ples. The ATA-27 can also accommodate a full 
range of tissue spot sizes from 

0.6 to 3 millimetres. 
One concern 
regarding TMAs 
has been the rep- 
resentation of 
a whole tissue 
section by a 
single spot. “As 
the through- 
Kin res put is so much 
AM Riee higher, I think 
es? that trumps the fact 
that it is not a whole 
section,’ says Rimm. But 
he cautions that care must be 
taken when constructing tis- 
sue microarrays. The optimal 
region to be sampled for the 
microarray is usually identi- 
fied by a pathologist, but only 
a section or core of this region 
can be placed into the array. 
Problems can arise because the 
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region that cores are taken from can vary. A core 
taken from the leading edge of a tumour might 
provide different results from a core acquired 
from the middle. So design of the slide is cru- 
cial; spots should be separated and standards 
(spots of known tissues) used. 

In the past, pathologists have evaluated and 
analysed TMAs following immunohistochem- 
istry or other histological analyses. But that is 
now changing, and some companies are offer- 
ing software specifically for automated TMA 
analysis. 


Image bank 

Bacus Laboratories in Lombard, Illinois, and 
recently acquired by Olympus, is one such 
company. It has focused its efforts on virtual 
microscopy — the digital imaging of micro- 
scope slides — which involves scanning an 
entire slide at very high resolution to acquire 
a large number of images for each slide. The 
images are stored in a database and the virtual 
slide can be reconstructed using software that 
puts the single images back together to form 
a whole. 

It was virtual microscopy that led Bacus into 
the world of tissue microarrays. “Several years 
ago, there were no good scanning methods 
for tissue microarrays,’ says James Bacus, the 
company’s president, “but we had the ability to 
scan entire slides.” Using virtual microscopy as 
a starting point, Bacus has developed several 
programs for quantitative, automated TMA 
analysis including TMAscore and IHCscore. 

Several other companies also provide 
automated TMA analysis solutions. Beecher 
Instruments has the TMAx software pack- 
age that relies on morphometric processing 
algorithms and classification rules to auto- 
matically interpret TMAs. And HistoRX in 
New Haven, Connecticut, offers the AQUA 
automated quantitative pathology system for 
biomarker discovery and validation from tis- 
sue samples. 

With TMAs moving FFPE analysis onto the 
fast track of high-throughput analysis, all the 
pieces are falling into place for researchers to 
begin exploring archival FFPE tissue samples. 
The coming years could prove very interesting 
indeed as the molecular secrets of these FFPE 
samples are finally unlocked. a 
Nathan Blow is the technology editor for 
Nature and Nature Methods. 


EXPRESSION PATHOLOGY 


Tissue Issues 


Millions of tissue samples have been collected and archived, but researchers wanting to explore them at 
the molecular level have found it tough going. Nathan Blow investigates the issues. 


According to experts, there are more than a bil- 
lion tissue samples archived in hospitals and 
tissue banks around the world, most of them 
formalin-fixed and paraffin-embedded (FFPE). 
Today, these samples present both an incredible 
opportunity and a huge challenge to research- 
ers. FFPE tissue samples have been extensively 
annotated and well preserved, allowing detailed 
study of the progression of diseases such as 
cancer. But due to the method of preservation, 
obtaining biomolecules from these samples is 
proving difficult, to say the least. 

FFPE was first described more than 100 years 
ago, and most hospitals still use this method 
today. But there has never been a set of stand- 
ardized guidelines for processing FFPE tissue 
samples taken from patients to preserve tissue 
histology, let alone biomolecules. And although 
it works well for histology, the lack of standard- 
ized guidelines seems to have hampered the 
use of FFPE samples in molecular analyses. 
This may soon change, as pathologists are 
working towards standardizing FFPE sample 
preparation, and companies and researchers 
are developing the technology needed to iso- 
late biomolecules and tap into the vast treasure 


chest of archived samples. 
Although FFPE tissue prep- 
aration is simple in theory, 
many problems associated 
with downstream molecular 
applications — such as PCR 
or microarray analysis — can 
arise. “This is all about the fact 
that there has been no attention 
paid to uniformity of prepara- 
tion,” says David Rimm, a 
pathologist at Yale University. 
Between hospitals the time to 


tissue fixation and even the _ tissue analysis. 


David Rimm: time is key in 


enzymes degrading and modi- 
fying both. 

It seems obvious that rapid 
fixation is the answer, but no 
simple solution is in sight. 
Rimm says that for a researcher 
interested in only DNA or RNA, 
rapid fixation using quick freez- 
ing methods, instead of chemi- 
cal fixation, is probably best. 
But if you are interested in pro- 
teins, freezing is problematic as 
the subsequent thawing process 
tends to break up and denature 


3 


method of fixation can vary 
dramatically. 

“I would say that the biggest issue is time 
from ligation of circulation to fixation,” says 
Rimm. During this period of ischaemia, 
molecular changes occur that cause problems 
in obtaining biomolecules. “Phosphorylation 
is very sensitive to ischaemic times. There 
seems to be promiscuous phosphotases in the 
cell that knock phosphates off tyrosines dur- 
ing this period,” says Rimm. DNA and RNA 
can also suffer damage before fixation, with 


proteins and, like ischaemia, 
can lead to modifications. 

Then there are pathologists who think that 
standardizing the time to fixation will prove to 
bea difficult task. “Standardization would be 
great, but I don't know how realistic it is across 
institutions or even within institutions,” says 
Christine Iacobuzio-Donahue, a pathologist 
at Johns Hopkins University School of Medi- 
cine in Baltimore, Maryland. She believes that 
the best place to tackle the problems caused by 
ischaemia is actually downstream of fixation, 


FROZEN IN TIME 


Analysing proteins from tissue 
samples that have been formalin- 
fixed and paraffin-embedded 
(FFPE) can provide critical 
information about how cells 
function before fixation. “When 
you formalin-fix the protein, you fix 
it in time; it is not going anywhere,” 
says Peter Tunon, vice-president of 
sales and marketing at Expression 
Pathology in Gaithersburg, 
Maryland. Researchers are taking 
advantage of this fact to explore 
the protein world in FFPE samples. 


Caught on camera: a section of human muscle 


for broad-based 
screening of 

proteins in FFPE 
samples. Other 
companies also 


all reagents phosphorylation and acetylation, 
are completely can also be observed when 
compatible examining proteins isolated 

with mass- from FFPE samples. PTMs can 
spectrometry be critical to the role of a protein 
instrumentation, in the cell, changing the function 
says Tunon, or localization. “We do see post- 
making it a good translational modifications that 
starting point are preserved on the peptides, and 


they seem to be present in ratios 
that are similar to those we get 
when using fresh frozen tissues,” 
Tunon says of work done by 
Expression Pathology. 

Even much older archival 


Expression Pathology was 
founded in 2001 by researchers 
from the US National Cancer 
Institute and the company Life 
Technologies (now Invitrogen) 
who had experience in studying 
gene expression in tissue and 
histology. “The company was 
founded on the fact that examining 
protein expression is crucial to 
understanding what is happening 
in cells," says Tunon. To this end, 
Expression Pathology has worked 


after dual histochemistry. 


think that protein 


on ways of extracting and isolating 
proteins from FFPE tissue samples 
for analysis by mass spectrometry 
or reverse-phase dot blot 

arrays. The company developed 

a technology that integrates 
extraction of total proteins 

from FFPE samples with tryptic 
digestion so that finished samples 
are ready for mass spectrometry. 
To simplify the system further, 


isolation from 

FFPE tissue samples will provide 
valuable information. QIAGEN 
and EMD Chemicals, both based 
in San Diego, California, now offer 
systems that chemically reverse 
formalin crosslinking to isolate full- 
length proteins for applications 
such as western blotting and 
protein arrays. 

Surprisingly, post-translational 
modifications (PTMs) such as 
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samples of FFPE tissues do not 
seem to pose a problem for protein 
extraction. “We have worked 

with samples that are more 

than 15 years old and found no 
difference in the profiles compared 
to samples that were just a few 
weeks old,” says Tunon. He is quick 
to add that he believes even older 
samples could be examined, but 
Expression Pathology has yet to 
test this idea. N.B. 
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MOLECULAR DEVICES 


after the biomolecules have been isolated. 
“Once you have the samples and have extracted 
the biomolecules you are interested in, then 
you can perform quality control to determine 
if all your samples are similar,’ she says. 

Even if the optimal time to fixation were 
found and standardized, this is only part of 
the problem; researchers still need to identify 
the best fixative to preserve both histology and 
biomolecules. Formalin, the most widely used 
fixative, provides good preservation of tissue 
histology but can cause problems for research- 
ers interested in downstream molecular appli- 
cations. “Formalin tends to crosslink the tissue 
to such an extent that it is hard to get fragments 
of DNA that are longer than, maybe, a hun- 
dred or two hundred base pairs,’ says Rimm. 
Formalin fixation causes the crosslinking of 
DNA as well as RNA and proteins, although 
proteins seem to fair better following formalin 
treatment (see ‘Frozen in time’). 


Preservation society 

Although tissue preservation has its problems, 
the good news is that these are now being dealt 
with. Governmental agencies, such as the US 
Office of Biorepositories and Biospecimen 
Research at the National Cancer Institute 
(NCI) in Bethesda, Maryland, are starting to 
tackle the difficult issue of biospecimen stand- 
ardization. In June, the office released a guide 
to the NCI’s best practices for biospecimen 
resources, detailing technical guidelines for 
NCI-supported biospecimen collection and 
storage. And researchers and companies are 


The DASL assay can be processed on two different platforms: the Array Matrix (left) or the BeadChip. 


now creating methods to use the degraded and 
modified biomolecules obtained from FFPE 
samples for molecular analysis. 

Isolation of RNA or DNA from FFPE tissue 
samples can be accomplished using a number 
of methods or kits. The main problem is that, 
almost without fail, the RNA or DNA isolated 
is degraded and chemically modified. But 
because researchers want to tap into the vast 
archives of FFPE tissues for global expression 
analysis and biomarker discovery, this is spur- 
ring companies to address the issue. 

Microarray analysis has proven a valuable 
tool for understanding global gene-expression 
patterns. But using the degraded messenger 
RNA obtained from FFPE tissues for such anal- 
ysis is problematic. “The results on standard 
microarrays are currently unsatisfactory,’ says 
Shawn Baker, scientific product manager for 
gene expression at Illumina in San Diego, Cali- 
fornia. For this reason, Illumina offers a gene- 
expression application that can study RNA 


extracted from FFPE tissue samples. Called 
DASL, short for cCDNA-mediated annealing, 
selection, extension and ligation assay, this 
system amplifies the mRNAs from FFPE- 
extracted samples, but unlike other amplifi- 
cation systems it is not 3’ biased, says Baker. 
“We use a combination of random and oligo dT 
primers to generate the complementary DNA, 
which means that even with degraded RNA it 
still amplifies quite well and produces good, 
consistent profiles.” Following the first amplifi- 
cation, the DASL system uses two gene-specific 
probes to amplify the cDNA. The resulting 
cDNA can be hybridized to a DASL-specific 
array. “The DASL assay is multiplexed up to 
1,536 genes. So you get a tremendous boost in 
the overall throughput,” says Baker. 

Although unable to survey as many genes 
as standard microarrays, Baker says that using 
the DASL system, researchers have been able 
to profile FFPE tissue samples that are up to 
30 years old — demonstrating the potential to 
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THE CUTTING EDGE 


Researchers have the opportunity 


a slide to a collection vessel. 


to play surgeon — slicing 
and dissecting out specific 
sections of tissue or even cell 
populations — with laser-capture 
microdissection (LCM). This 
level of analysis might seem to be 
difficult when applied to formalin- 
fixed and paraffin-embedded 
(FFPE) tissue samples, but many 
companies are now offering easy 
and quick LCM solutions. 

Leica Microsystems of 
Wetzlar, Germany, offers the 
LMD6000 LCM system, which 
uses an upright microscope for 
dissection and capture. Christoph 
Horlemann, the company’s 
product manager for the 
LMD6000, says this is possible 
because the transport mechanism 
for capture is based on gravity, 
unlike other LCM systems on 
the market. 

‘Transport mechanism’ refers 
to the method for delivering a 
dissected tissue sample from 
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Other approaches to this process 
include that of PALM Microlaser 
Technologies in Bernried, 
Germany, which uses a ‘pressure 
catapult’ to send sections into 
tubes from the LCM instrument, 
and the CapSure system from 
Molecular Devices in Sunnyvale, 
California, which uses the laser to 
extend a polymer onto the tissue 
sample for capture. 

Leica Microsystems’ transport 
mechanism uses slides with a foil 
covering. The tissue is attached to 
the foil covering but not the slide, 
so researchers cut the tissue and 
the foil together and the sample 
simply falls by gravity into the 
collection tube below, explains 
Horlemann. But even this foil 
coating might not be needed in 
the future. 

Both Leica and PALM are 
working with Expression 
Pathology of Gaithersburg, 
Maryland, on what may be 


The Arcturus XT from 
Molecular Devices can 
use either ultraviolet or 
infrared laser light for 
tissue dissection. 


the next generation of LCM 
slides. Called Director, these 
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glass slides are based on laser- 
induced forward transfer (LIFT), 
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examine the numerous collections of archived 
FFPE samples. Illumina is now working to 
increase the number of genes analysed for each 
DASL assay. 

NuGEN Technologies in San Carlos, Cali- 
fornia, is another company developing meth- 
ods to use RNA extracted from FFPE tissue 
samples for gene-expression analysis. NuGEN 
specializes in working with very small amounts 
of RNA or difficult-to-use RNA, such as RNA 
extracted from whole blood or very degraded 
sources. This is commonly seen with clinical 
samples, but is most significant in the case of 
FFPE, says Gianfranco de Feo, senior director 
of customer solutions at NuGEN. Although 
NuGEN did not start offlooking at RNA from 
FFPE tissue samples, it was the next logical step. 
“We have had a product on the market for over 
a year now that allows users to work with very 
degraded RNA in very limited amounts, down 
to 500 picograms. We built on that technology 
to create kits for the much more degraded RNA 
that comes from FFPE samples,’ says de Feo. 

At the core of NuGEN’s technology is its 
amplification and labelling system, which has 
been optimized to work with Affymetrix 3’ 
microarrays. The system relies on a combination 
ofrandom hexamers, similar to that of Illumina, 
augmented with oligo (dT) primers to convert 
mRNA into cDNA in a linear amplification 
process. The inclusion of oligo (dT) primers was 
essential because the Affymetrix arrays probe 
the 3’ ends of transcripts. But NuGEN hopes 
to have labelling and hybridization protocols 
and products for other microarray platforms 
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available before the end of the year. 

To determine how well these degraded RNA 
samples from FFPE tissue will work on micro- 
arrays, NuGEN developed a tool using quanti- 
tative real-time PCR (qPCR) assay. It turns out 
that the results from this assay correlate very 
well with the overall results of microarray analy- 
sis, says de Feo, allowing researchers to decide 
whether the data that could be obtained from 
the array will be of sufficient quality to continue. 
And this is critical information, as in some cases 
less than 50% of the transcripts on the array may 
hybridize with the amplified RNA. 


Think small 
Asuragen in Austin, Texas, is a new company 
on the commercial block, working to under- 
stand and characterize the biological role of 
small RNAs. Although founded only a year 
ago, Asuragen’s RNA roots go much deeper. 
Asuragen is a spin-off of Ambion, a company 
that worked in the field of molecular biology 
with a focus on RNA for nearly 17 years. “At 
Ambion, we developed the first kits and tech- 
nologies for characterizing small RNAs,” says 
Gary Latham, associate director of technology 
development at Asuragen. “And when micro- 
RNAs emerged as a new class of regulatory 
RNAs in humans, we were sitting in an excel- 
lent position to explore this area of ‘biological 
dark matter.” In March 2006, Ambion was 
sold to Applied Biosystems for US$273 mil- 
lion and a portion of those proceeds were used 
to fund Asuragen. 

Asuragen has concentrated its efforts on the 


Leica Microsystems offers the LMD6000 for 
laser capture microdissection applications. 


diagnostic and therapeutic opportunities of 
microRNAs (miRNAs). “As they are smaller, 
miRNAs tend to survive the more tortuous 
conditions of FFPE tissue processing better 
than mRNAs,’ says Latham, which makes 


anon-contact microdissection 
technique that uses a thin energy- 
transfer coating that replaces 
plastic films or adhesives. The 
technology was co-developed 
by researchers at the US 
Naval Research Laboratory in 
Washington DC and scientists 
from Expression Pathology. 

Laser energy is transferred to 
the transfer layer and the layer 
is vaporized. The laser energy is 
then converted into kinetic energy, 
and the selected feature is shot 
instantly into a collection tube. 

LIFT works equally well whether 
the tissue is collected up (PALM) 
or down (Leica) as the energy is 
sufficient to propel the tissue into 
the collection tube either way. 
As the transfer layer completely 
absorbs the laser energy, the 
biomolecules in the sample are 
not affected. The use of glass with 
this coating has other advantages 
as well. 

"Glass slides are quite useful as 


you can also perform fluorescence 
and contrast applications 
without any interfering foil,” says 
Horlemann. 

Molecular Devices, a division 
of MDS Analytical Technologies, 
acquired Arcturus in 2006 
and is now providing both the 
Veritas and the Arcturus XT LCM 
systems. Whereas most LCM 
systems use an ultraviolet laser 
for cutting samples, the systems 
offered by Molecular Devices 
can have either ultraviolet or the 
standard infrared laser options. 

“The infrared laser is ideal for 
small areas where a user is looking 
to pick up only a cell or two,” says 
Steven Blakely, product manager 
for the Arcturus LCM systems. 
“Infrared allows for a gentle 
collection of cells.” 

The use of infrared laser light 
is critical to the CapSure system 
because the dye found in the 
CapSure polymer is activated and 
becomes adherent with infrared 


light. Blakely says the CapSure 
system is particularly useful for 
FFPE samples that are already 
mounted on glass slides because 
the transport mechanism involves 
only the polymer adhering to the 
tissue and so can work with any 
glass slide. 

Cutting tissue samples with 
ultraviolet light offers advantages 
too, such as dissecting thicker 
samples. Leica has worked 
to optimize objectives for its 
LCM systems with high-energy 
ultraviolet-light transmission, 
allowing more power to come 
in direct contact with the tissue 
sample. Horlemann says this 
provides faster cutting of the 
thicker tissue samples while using 
less power. 

The future of LCM might just lie 
in automation and increases in 
throughput. “Now, researchers 
want to do faster, automated 
microdissection for proteomics,” 
says Horlemann. But before 
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anything else, some hardware 
and software issues needed to 
be resolved. 

Any automated software 
package has to control all 
microdissection steps including 
focusing the microscope, 
recognizing cells of interest, 
focusing the laser and defining 
the area to cut — definitely 
not a simple task, but one that 
companies have worked on and 
made tremendous strides in 
recent years. Horlemann says 
that new advances are making 
automation easier every day. 

He points to the Director slides, 
which can allow for contrasting 
methods and fluorescence, 
making it easier when attempting 
to automatically define cells of 
interest as astep in the right 
direction. 

Although simpler now, further 
advances will be required before 
LCM becomes as easy as pushing 
a button and walking away. _—N.B. 
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COMPANY PRODUCTS/ACTIVITY LOCATION URL 

Gene expression analysis 

Affymetrix SNP genotyping: microarray analysis Santa Clara, California www.affymetrix.com 
Aglient Microarray; mass spectrometry; lab-on-a-chip methods Wilmington, Delaware www.aglient.com 

Applied Biosystems Real-time PCR; SNP genotyping; microarrays; microRNA analysis Foster City, California www.appliedbiosystems.com 
Asuragen miRNA-based applications and therapeutics; cancer detection; pharmacogenomics Austin, Texas www.asuragen.com 
BioDiscovery Microarray analysis software provider El Segundo, California www.biodiscovery.com 
Bio-Rad Laboratories Real-time PCR instruments and accessories; microarray accessories Hercules, California www.bio-rad.com 
Ciphergen Biomaker identification for drug discovery and molecular imaging of targets Fremont, California www.ciphergen.com 
Chemicon PCR kits Temecula, California www.chemicon.com 

EMD Chemicals Kits for extraction of proteins from FFPE tissue samples San Diego, California www.emdbiosciences.com 
Eppendorf DualChip DNA microarrays Hamburg, Germany www.eppendorf.com 


Expression Analysis 

GE Healthcare 

Genomic Solutions 

Idaho Technology 

Illumina 

Integrated DNA Technologies 


Invitrogen 


NuGEN 

Operon Biotechnologies 
Orchid Cellmark 

Oxford Gene Technology 


Panomics 


Primera Biosystems 


Microarray-based genomics services for clinical trials 


Microarrays; sequencing; SNP genotyping 


Assay assembly 


for SNP genotyping, sequencing, PCR; microarray printing 


RapidCycler for real-time PCR 


SNP analysis; microarrays; DNA sequencing; DASL system 


Custom oligonucleotides for microarrays 


Microarray accessories; array comparative genomic hybridization kits; RT PCR; gene-expression 
analysis services 


RNA isolation and analysis 


Preprinted oligonucleotide arrays and probe sets 


SNP analysis for degraded samples; forensic analysis 


Oligonucleotide arrays for various applications 


Quantitative gene-expression measurements using QuantiGene system; protein and DNA arrays; 


cytokine arrays 


Multiplex RT PCR; microarray 


Durham, North Carolina 
Piscataway, New Jersey 
Ann Arbor, Michigan 
Lake City, Utah 


San Diego, California 
Coralville, lowa 


San Diego, California 


San Carlos, California 
Huntsville, Alabama 
Princeton, New Jersey 


Oxford, UK 


Fremont, California 


Mansfield, Massachusetts 


www.expressionanalysis.com 
www.gehealthcare.com 
www.genomesolutions.com 
www.idahotech.com 
www.illumina.com 
www.idtdna.com 


www. invitrogen.com 


www.nugeninc.com 
www.operon.com 
www.orchidbio.com 
www.ogt.co.uk 


www.panomics.com 


www.primerabio.com 


QIAGEN RT PCR and real-time PCR systems; epigenetics; laser microdissection accessories; RNA- Germantown, Maryland www.qiagen.com 
extraction systems 

SuperArray Real-time PCR; focused microarray; RNAi; gene-expression analysis services Fredrick, Maryland www.superarray.com 

TeleChem International Microarray robots, printers and scanners; microarray services; microarray accessories such as Sunnyvale, California www.arrayit.com 
glass slides 

Laser microdissection systems and accessories 

Bucher Biotech Laser-capture microdissection systems Basel, Switzerland www.bucher.ch 

Carl Zeiss PALM laser microdissector; PALM capture system Berlin, Germany www.zeiss.de 

Erie Scientific Company Microscope slides for use in laser-capture microdissection Portsmouth, New Hampshire www.eriesci.com 


Expression Pathology 
Leica Microsystems 
Molecular Devices 


Molecular Machines and 
Industries 


Slides for laser-capture microdissection applications 


Microscopes, mi 


Laser-capturem 


croscopy software, laser-capture microdissection 


icrodissection system; CapSure capture technology 


Laser microdissection; micromanipulations 


PALM Microlaser Technologies Non-contact laser-based micromanipulation 


Gaithersburg, Maryland 
Wetzlar, Germany 
Sunnyvale, California 


Glattbrugg, Switzerland 


Bernried, Germany 


www.expressionpathology.com 
www.leica-microsystems.com 
www.molecular-device.com 


www.molecular-machines.com 


www.palm-microlaser.com 


Tissue microarray products and suppliers 


AMS Biotechnology 
Aperio 
Bacus Laboratories 


Beecher Instruments 


Biochain 
Bioimagene 
Cybrdi 
Cytomyx 
HistoGeneX 


HistoRx 


Imgenex 


Frozen and paraf 


Virtual microsco 


abrication 


Frozen and paraf 


Tissue microarra 


Tissue bioreposi 


in-based tissue microarrays 


Digital tissue microarray analysis 


py; automated analysis of tissue microarrays; BLISS virtual scanner system 


Manual and automated tissue microarrayers; accessories for microarray construction and 


in-based tissue microarrays; antibodies; DNA extraction kits 


Digital pathology; software for the analysis of tissue microarrays 


ys; Clinical specimens; immunohistochemistry services 


ory; RT PCR cDNA panels; tissue sections, tissue microarrays 


Services for the detection and localization of cell-specific gene expression on paraffin-embedded 


tissue samples 


Automated analysis of immunohistochemistry 


Histo-arrays from human and animal tissue sources; tissue sections; antibodies 
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Abingdon, UK 
Vista, California 
Lombard, Illinois 


Sun Prairie, Wisconsin 


Hayward, California 
Cupertino, California 


Frederick, Maryland 


Lexington, Massachusetts 


Edegem, Belgium 


New Haven, Connecticut 


San Diego, California 


www.amsbio.com 
www.aperio.com 
www.bacuslabs.com 


www.beecherinstruments.com 


www.biochain.com 
www.bioimagene.com 
www.cybrdi.com 
www.cytomyx.com 


www.histogenex.com 


www.historx.com 


www.imgenex.com 
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COMPANY PRODUCTS/ACTIVITY LOCATION URL 
Pathology Devices Semi-automated tissue microarrayer and accessories Westminster, Maryland www.pathologydevices.com 
Petagen Accumax tissue microarrays Seoul, Korea www.petagen.com 


Premier Biosoft International 


Protein Biotechnologies 


Tissue-microarray analysis software for biomarker relatedness 


Tissue-microarray supplier; human tissue cancer lysates 


Palo Alto, California 


Ramona, California 


www.premierbiosoft.com 


www.proteinbiotechnologies.com 


SuperBiochips Human and animal tissue microarrays; immunohistochemistry services Seoul, Korea www.tissue-array.com 
Unitma Tissue microarrays; tissue microarray fabrication systems Seoul, Korea www.unitma.com 

US BioMax Tissue microarrays; tissue banking protein arrays; GPCR ljamsville, Maryland www.usbiomax.com 
Zyagen Tissue samples; tissue microarrays; total RNA; cDNA; tissue section services San Diego, California www.zyagen.com 
Microscopy and imaging 

Andor Technology CCD and intensified CCD camera systems Belfast, UK www.andor-tech.com 


Applied Imaging 
Applied Cytometry Systems 
Applied Scientific 


Biomedical Photometric 


Developed Ariol high-throughput immunohistochemistry system 


Sof 
Au 


ware and cytometry systems 


‘omated systems for microscopy and fluorescence 


Tissue imaging instrumentation; high-resolution whole tissue imaging 


San Jose, California 
Sheffield, UK 
Eugene, Oregon 


Waterloo, Canada 


www.aicorp.com 
www.appliedcytometry.com 
www.asiimaging.com 


www.confocal.com 


Bioptonics Optical projection tomography scanning Edinburgh, UK www.bioptonics.com 

Bitplane Imaris image analysis software Zurich, Switzerland www.bitplane.com 

Chroma Technology Optical filter sets Brattleboro, Vermont www.chroma.com 

Clemex Image analysis software Longueuil, Quebec www.clemex.com 

Fujifilm Imaging systems for array analysis Stamford, Connecticut www.fujimed.com e 
Hamamatsu Photonics Cameras for micrscopy, light sources Hamamatsu City, Japan www.hamamatsu.com 

Kodak Image analysis software; imaging systems Rochester, New York www.kodak.com 

Micro Video Instruments Microscopes and accessories Avon, Massachusetts www.mmvi-inc.com 

Nikon Instruments Microscopes, microsopy analysis software and accessories Melville, New York www.nikoninstruments.com e 
Olympus Micrscopes, analysis software, accessories Center Valley, Pennsylvania www.olympus.com 

Omega Optical Microscopes and filters Brattleboro, Vermont www.omegafilters.com 

Vision Biosystems Automated immunohistochemistry systems and reagents Mount Waverly, Australia www.vision-bio.com 

General 

Beckman Coulter Tools and systems for molecular biology, genomics and proteomics Fullerton, California www.beckmancoulter.com 

BMG Labtech Microplate readers and handling systems Offenburg, Germany www.bmg-labtechnologies.com 
Cambrex Molecular and cell-biology reagents and tools East Rutherford, New Jersey www.cambrex.com 

Epicentre Biotechnologies Enzymes for PCR and RT PCR; DNA and RNA purification Madison, Wisconsin www.epibio.com 

Hamilton Company Robotics for life sciences Reno, Nevada www.hamiltoncomp.com 

Lonza Molecular-biology reagents and systems Basel, Switzerland www.lonza.com 

Merck Chemicals kits and reagents for molecular and cell-biology research Darmstadt, Germany www.merck.de 

Millipore Automated equipment for genomics and proteomics Billerica, Massachusetts www.millipore.com 

MP Biomedicals Reagents and chemicals for research Aurora, Ohio www.mpbio.com 

New Brunswick Scientific Tissue culture; incubators; shakers Edison, New Jersey www.nbsc.com 

New England Biolabs Molecular-biology reagents, kits and enzymes Ipswich, Massachusetts www.neb.com 

Sigma-Aldrich PCR buffers and enzymes; catalogue chemical supplier St Louis, Missouri www.sigmaaldrich.com e 
Stoelting Physiology research equipment, histology equipment Wood Dale, Illinois www.stoeltingco.com 

Takara Bio Reagents, kits and services for genomics and molecular biology Shiga, Japan www.takara-bio.com e 
USB Chemicals and reagents for molecular biology Cleveland, Ohio www.usweb.com e 


@ see advertisement 
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naturejobs 


he drug industry has not been kind to its employees of late. With 

profits falling, pharmaceutical companies have been laying off staff, 

and thousands of scientists have felt the pinch. In January, Pfizer cut 

its work force by about 10%, a loss of 10,000 jobs. In July, AstraZeneca 
announced it would cut 7,600 jobs, again about 10% of its workforce. A few 
days later, Johnson & Johnson announced cuts of 4,800 jobs (about 4% of the 
workforce). The cuts affected employees across the United States and Europe. And 
last week, biotechnology firm Amgen announced lay-offs of between 2,200 and 
2,600 people (12-14% of its workforce) — the company’s first major lay-off. 

Such ‘realignments’ have reverberating effects on the life-sciences job market 
and beyond. Along with scientists, marketing, sales and support staff have lost their 
jobs. Some biotechnology firms, which increasingly rely on drug-firm partners for 
revenue, have been forced to downsize as well. 

Most drug companies blame dried-up drug pipelines. Blockbuster pills such 
as Viagra are increasingly rare finds, and the drugs that are far along in the 
development stages are increasingly hard to get approved for sale. High-profile 
tragedies, such as those related to Merck’s Vioxx, haven't helped matters. 

But there are still jobs out there. As we reported last year (see Nature 439, 
886-887; 2006), companies continue to recruit even in lean years. And the most 
recent numbers from the US Bureau of Labor Statistics suggest that employment 
for chemists and biologists in the US manufacturing sector for pharmaceutical and 
medicine is expected to grow by about 25% by 2014. 

So scientists looking to industry for a livelihood need to do their homework. 
When and where were the company’s last lay-offs? What are the company’s bread- 
and-butter product lines? When do the patents run out? Do generic drugs threaten 
the company’s bottom line? What else is in the pipeline? 


Of course, there's no ignoring the most commonsense question of all — in the 


event of one of those inconvenient realignments, what's my plan B? 
Gene Russo, acting editor of Naturejobs 
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Formic gender disorder 


Divide and conquer. 


Barrington J. Bayley 


The waywardly named formic gender dis- 
order (FGD) is the foremost of a growing 
number of ailments some physicians still 
refuse to recognize. The Encyclopaedia of 
Psychocentric Dysfunctions, compiled by 
Hortense Constantia, MD, in 2025, lists 
it alongside other disorders dismissed 
as imaginary. The first case to come to 
attention was that of Hattie Jacquefils, a 
45-year-old housewife of Rootesville, Lou- 
isiana, who in 2022 ran screaming from 
her house in terror of her husband, a man 
of known placid and loving disposition. 

Mrs Jacquefils was admitted to the psychi- 

atric ward of Rootesville General Hospital 

where she cowered and tried to hide from 
certain members of staff. In the follow- 


ing months a cluster of similar cases © 


occurred in a swathe of countryside 
to the east of the town. 

Determination of the 
aetiology has been slow. 
Initially investigation 
was psychiatric and 
the condition classified 
alongside other phobias 
and obsessive-compulsive 
disorders. Moves to carry 
out physical examina- 
tions were delayed by 
political objections, 
particularly on the part 
ofa feminist faction in the Associa- 
tion of American Psychiatrists. Even now 
sufferers universally withhold consent to 
microbial studies. Only after some years 
was the disorder dignified with a diagnosis 
of somatic pathology as the view gradually 
began to prevail that it is a fungal infection 
of the nervous system. Relevant references 
are appended. 

Federal authorities have now been forced 
to admit that an eastern suburb of Rootes- 
ville is home to the Canine and Feline 
Accidental Injuries Statistical Institute 
(CFAISI), a clandestine micropathology 
research laboratory funded by the Central 
Intelligence Agency. The laboratory had for 
20 years been studying a nervous disease 
affecting the solitary stink ant, a denizen of 
the Western Cameroons. Occasionally the 
stink ant breathes in the spores ofa fungus, 
which then grows inside its body, attacks 
its brain and controls its behaviour. Under 
the influence of the fungus the stink ant, 
for the first time in its life, climbs the stem 
of a plant, sinks its mandibles into it and 
clings there until it is consumed by the fun- 


gus, having gained sufficient height from 
which to spread fresh spores. 

A behaviour-controlling infectious dis- 
ease has obvious political value. Formic 
gender disorder was named in honour of 
the African solitary stink ant by Doctors 
Humbold Gresgud and Ebenezer Schwartz, 
both workers at the CFAISI. The initial 
swathe of infection did, it is true, radiate 
from a point coinciding with the location 
of the institute, and the laboratory where 
Doctors Gresgud and Schwartz pursued 
their research is situated on the tenth floor 
of the building, an ideal height from which 
to spread a plume of windblown 

spores over the cotton planta- 
tions east of Rootesville. 


A comprehensive history of the disease 
should mention that Doctors Gresgud 
and Schwartz were allegedly infamous for 
their unbending misogyny. It should also 
be stated that they both vehemently denied 
being responsible for the epidemic, claim- 
ing that they merely identified the cause of 
the disease where other medical laborato- 
ries had failed. Both refused to voice any 
sympathy for the epidemic’s victims. 

Symptoms are insidious at onset, but 
progressive. The early phase resembles 
Tourette’s syndrome, the patient being 
unable to refrain from outbursts of extrav- 
agant language. Male sufferers first mutter 
then shout with increasing frequency: “Fie, 
fie, fickle female!” Female victims in simi- 
lar manner splutter: “Bastards! Bastards! 
Bastards!” The preliminary phase typically 
lasts three days. A vehement revulsion for 
the opposite sex then sets in, both psy- 
chologically and physically, developing in 
not less than 48 hours into a dread which 
equals the intensity of rabid hydrophobia. 
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Once the early phases are gone through 
sufferers are able to function normally pro- 
vided they meet with no visual, auditory, 
olfactory or verbal stimuli reminding them 
of the opposite sex. 

The disorder has defied attempts at 
treatment. Segregation of sufferers is the 
only recourse. The rapid spread of the dis- 
ease is surprising. It can be transmitted by 
skin contact, sneezing or even by breathing 
heavily in the same room. It also comes as 
a surprise that the sexual link binding the 
male and female sides of the human race 
can be so drastically severed. New forms 
of social organization have perforce arisen 
in those districts where the epidemic has 
taken hold, leading to the emergence of a 
new study in the field of sociology. 

For all that, it is necessary to inter- 


v2 ject a professional note. Criticism 


must be levelled at those who step 
outside medical bounds and spread 
alarmist prognoses, stating 
that should it come about 
that males and females 
no longer reproduce 
in partnership they are 
henceforth, by definition, 
separate species; predicting 
the division of the planet 
into male and female 

hemispheres, and so 

forth. It is admittedly 

fortuitous that non- 
sexual methods of reproduction are 
now available. But it should be stressed 
that medical science has faced greater chal- 
lenges than simple gender phobia before 
now. If formic gender disorder introduces 
any novelty into the medical record it is 
that this is the first contagious disease in 
history whose sufferers unanimously pro- 
claim it an inestimable boon to human- 
kind. 


References 

Baker, C. N. M. Bacteria and fungal spores in neural 
tissues of east-of-Rootesville field rodents. Quarterly 
Journal of Immunology (spring 2030). 

Department of Neuropathology, Fremen Hospital, San 
Francisco. Cellular examination of formic gender 
disorder cadaver. Quarterly Journal of Immunology 
(fall 2030). 

Department of Neuropathology, Fremen Hospital, San 
Francisco. Aberrant dendritic growth and its agents. 
Neurological Science Review (spring 2031). 


Submitted by: John Barrington, MD. m 
Barrington J. Bayley's science-fiction 
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